Input apparatus, input method of input apparatus, and output apparatus

ABSTRACT

An input apparatus for inputting a diagnosis result of a diagnosis target detectable for a structure includes circuitry configured to display a spherical image captured for the structure on a screen, receive an input of a position of the diagnosis target in the spherical image, store position information indicating the received position of the diagnosis target in the spherical image in a memory, display, on the screen, the spherical image and a diagnosis information input field used for inputting diagnosis information of the diagnosis target, receive an input of the diagnosis information of the to diagnosis target input via the diagnosis information input field, and store the diagnosis information and the position information indicating the received position of the diagnosis target in the spherical image, in the memory in association with each other.

CROSS-REFERENCE TO RELATED APPLICATION

This application claims priority pursuant to 35 U.S.C. § 119(a) toJapanese Patent Application No. 2018-066142, filed on Mar. 29, 2018 inthe Japan Patent Office, the disclosure of which is incorporated byreference herein in its entirety.

BACKGROUND Technical Field

This disclosure relates to an input apparatus, an input method of aninput apparatus, and an output apparatus.

Background Art

Field inspections and/or surveys at various sites, such as buildingstructures, have become important issues. In particular, the fieldinspections and/or surveys are required in various cases, such asinspections for determining necessity of repairs, surveys for planningworks on construction sites, surveys for renewal of equipment, andsurveys for designing constructions. A technique of capturing a targetobject using an image capture apparatus when a survey is performed on asite and storing image data of the target object is known to save laborworks of the inspection and sharing information of inspection services.

When an inspection is performed on a structure to detect some findings,such as crack, results of the inspection may be required to be reportedas an inspection report. When the inspection report is to be created bya person, such as an inspector, the inspector can search positioninformation associated with positions of findings on a three-dimensionalspace model. However, since a task of associating the positioninformation of findings on the three-dimensional space model with theinformation of findings is performed manually, the report creation workbecomes a complicated work, and thereby a longer period of time isrequired to create the report.

SUMMARY

As one aspect of the present invention, an input apparatus for inputtinga diagnosis result of a diagnosis target detectable for a structure isdevised. The input apparatus includes circuitry configured to display aspherical image captured for the structure on a screen, receive an inputof a position of the diagnosis target in the spherical image, storeposition information indicating the received position of the diagnosistarget in the spherical image in a memory, display, on the screen, thespherical image and a diagnosis information input field used forinputting diagnosis information of the diagnosis target, receive aninput of the diagnosis information of the diagnosis target input via thediagnosis information input field, and store the diagnosis informationand the position information indicating the received position of thediagnosis target in the spherical image, in the memory in associationwith each other.

As another aspect of the present invention, a method of inputting adiagnosis result of a diagnosis target detectable for a structure isdevised. The method includes displaying a spherical image captured forthe structure on a screen, receiving an input of a position of thediagnosis target in the spherical image, storing position informationindicating the received position in a memory, displaying, on the screen,the spherical image and a diagnosis information input field used forinputting diagnosis information of the diagnosis target, receiving aninput of the diagnosis information of the diagnosis target input via thediagnosis information input field, and storing the diagnosis informationand the position information in the memory in association with eachother.

As another aspect of the present invention, an output apparatus foroutputting a diagnosis result of a diagnosis target detectable for astructure is devised. The output apparatus includes circuitry configuredto acquire position information indicating a position of the diagnosistarget in a spherical image, captured for the structure, and diagnosisinformation including a diagnosis result of the diagnosis target storedin a memory in association with each other, from the memory, and outputthe acquired diagnosis information of the diagnosis target based on theposition information associated with the diagnosis information.

BRIEF DESCRIPTION OF THE DRAWINGS

A more complete appreciation of the description and many of theattendant advantages and features thereof can be readily obtained andunderstood from the following detailed description with reference to theaccompanying drawings, wherein:

FIG. 1 is an example of an appearance view of an image capture apparatusof a first embodiment;

FIG. 2 illustrates an example of a structure of an image capture unitaccording to the first embodiment;

FIGS. 3A, 3B and 3C (FIG. 3) are schematic trihedral views (thee-sidedviews) of appearance of the image capture apparatus according to thefirst embodiment;

FIG. 4 is an example of a hardware block diagram of the image captureapparatus according to the first embodiment;

FIG. 5 is an example of a diagram illustrating an arrangement of abattery and a circuit unit in the image capture apparatus according tothe first embodiment;

FIG. 6 is an example of a hardware block diagram of an informationprocessing apparatus, which can be used as an input apparatus forinputting an annotation according to the first embodiment;

FIG. 7 is an example of a functional block diagram of the informationprocessing apparatus according to the first embodiment;

FIGS. 8A and 8B (FIG. 8) is an example of a diagram for describing howan imaging lens projects three-dimensional incident light in atwo-dimensional space according to the first embodiment;

FIG. 9 is a schematic view for describing an inclination of the imagecapture apparatus according to the first embodiment;

FIGS. 10A and 10B (FIG. 10) are an example of diagram for describing aformat of a spherical image according to the first embodiment;

FIG. 11 is an example of a diagram for describing correspondence betweeneach pixel position on a hemispherical image and each pixel position ona full view spherical image associated by a conversion table accordingto the first embodiment;

FIGS. 12A and 12B (FIG. 12) is an example of diagrams for describing avertical correction according to the first embodiment;

FIGS. 13A and 13B (FIG. 13) schematically illustrate examples ofconfigurations of an information processing system according to thefirst embodiment;

FIG. 14 is an example of a diagram for describing an image captureoperation using the image capture apparatus according to the firstembodiment;

FIG. 15A is an example of a diagram for describing an automaticestimation function of an image capture position according to the firstembodiment;

FIGS. 15B and 15C are an example of diagrams for describing threedimensional (3D) panoramic automatic tour function according to thefirst embodiment;

FIG. 15D is an example of a diagram for describing input andconfirmation function of annotation according to the first embodiment;

FIGS. 16A and 16B (FIG. 16) is an example of a flowchart illustratingthe steps of annotation input processing according to the firstembodiment;

FIG. 17 is an example of a diagram for describing image cutting processaccording to the first embodiment;

FIG. 18 is an example of a screen displayed using a display device undera control of a user interface (UI) unit according to the firstembodiment;

FIG. 19 is an example of a diagram illustrating a floor plan imageaccording to the first embodiment;

FIG. 20 is an example of an annotation input screen displayed on ascreen according to the first embodiment;

FIG. 21 is an example of an annotation input screen switched to a regiondesignation screen according to the first embodiment;

FIG. 22 is an example of displaying of a screen when designating a cutregion according to the first embodiment;

FIG. 23 is an example of a screen displaying a marker on a cut imagedisplayed in a cut image display field according to the firstembodiment;

FIG. 24 is an example of a displaying of a screen related to a reportdata creation according to the first embodiment;

FIG. 25 is an example of report data according to the first embodiment;

FIG. 26 is an example case when a crack is observed on a wall surfaceaccording to a variant example according to the first embodiment;

FIG. 27 is an example of an annotation input screen for inputting anannotation for a designated crack according to a variant exampleaccording to the first embodiment;

FIG. 28 is an example of an annotation input screen switched to a regiondesignation screen according to a variant example according to the firstembodiment;

FIG. 29 is an example of displaying of a screen for designating a cutregion according to a variant example according to the first embodiment;

FIG. 30 is an example of a schematic view of an image capture apparatusaccording to a second embodiment;

FIG. 31 illustrates an example of an image capture range that can becaptured by each of image capture units according to the secondembodiment;

FIG. 32 is an example of a functional block diagram of an informationprocessing apparatus used as an input apparatus for inputting anannotation according to the second embodiment;

FIGS. 33A, 33B, 33C, 33D, and 33E, and 33F (FIG. 33) illustrate examplesof images that are captured from different viewpoints using the imagecapture apparatus and synthesized at each of image capture unitsaccording to the second embodiment;

FIG. 34 is an example of a flowchart illustrating the steps of creatinga three-dimensional reconstruction model according to the secondembodiment;

FIG. 35 is an example of a diagram for describing triangular surveyingaccording to the second embodiment;

FIGS. 36A and 36B (FIG. 36) are an example of diagrams for describingthe principle of epipolar plane image (EPI) according to the secondembodiment;

FIGS. 37A and 37B (FIG. 37) are another example of diagrams fordescribing the principle of epipolar plane image (EPI) according to thesecond embodiment;

FIG. 38 is an example of a diagram for describing a point where a slope“m” becomes a value based on a curve when an EPI is composed ofomnidirectional image according to the second embodiment;

FIG. 39 is an example of another diagram for describing a point where aslope “m” becomes a value based on a curve when an EPI is composed ofomnidirectional image according to the second embodiment;

FIG. 40 is an example of a diagram of a plane in which disparity ispreferentially calculated based on an image captured by the imagecapture apparatus according to the second embodiment;

FIG. 41 is an example of a diagram of a large space including largebuildings as target objects for creating a three-dimensionalreconstruction model according to the second embodiment;

FIG. 42 is an example of a hardware block diagram of the image captureapparatus according to the second embodiment;

FIG. 43 is an example of a hardware block diagram of a control unit anda memory of an image capture apparatus according to the secondembodiment;

FIGS. 44A and 44B (FIG. 44) is an example of a diagram for describing aposition designation according to the second embodiment; and

FIG. 45 is an example of a diagram for describing a position designationusing a line.

The accompanying drawings are intended to depict embodiments of thepresent invention and should not be interpreted to limit the scopethereof. The accompanying drawings are not to be considered as drawn toscale unless explicitly noted.

DETAILED DESCRIPTION

A description is now given of exemplary embodiments of the presentinventions. It should be noted that although such terms as first,second, etc. may be used herein to describe various elements,components, regions, layers and/or units, it should be understood thatsuch elements, components, regions, layers and/or units are not limitedthereby because such terms are relative, that is, used only todistinguish one element, component, region, layer or unit from anotherregion, layer or unit. Thus, for example, a first element, component,region, layer or unit discussed below could be termed a second element,component, region, layer or unit without departing from the teachings ofthe present inventions.

In addition, it should be noted that the terminology used herein is forthe purpose of describing particular embodiments only and is notintended to be limiting of the present inventions. Thus, for example, asused herein, the singular forms “a”, “an” and “the” are intended toinclude the plural forms as well, unless the context clearly indicatesotherwise. Moreover, the terms “includes” and/or “including”, when usedin this specification, specify the presence of stated features,integers, steps, operations, elements, and/or components, but do notpreclude the presence or addition of one or more other features,integers, steps, operations, elements, components, and/or groupsthereof.

Hereinafter, a description is given of an input apparatus, an inputmethod of an input apparatus, an output apparatus, and an output methodof an output apparatus with reference to the accompanying drawings. Inthis disclosure, the input apparatus and the output apparatus aredescribed as an example of information processing apparatuses, such asimage data processing apparatuses, but not limited thereto.

First Embodiment

As to a first embodiment, a full view spherical image of a diagnosistarget object can be captured using an image capture apparatus capableof capturing the full view spherical image, and an annotation can beinput for the full view spherical image by designating a position of atarget (e.g., diagnosis target) on the full view spherical image. Theinput annotation is stored in association with position information ofthe target on the full view spherical image.

By using the full view spherical image to capture an image of thediagnosis target, an image of diagnosis target can be capturedcompletely without missing a part of the diagnosis target by performinga smaller number of image capturing operations. Further, since theannotation can be input by designating a position of the diagnosistarget on the full view spherical image, the diagnosis target, inputwith the annotation, can be intuitively recognized, and thereby inputerrors can be reduced. Further, since the input annotation is stored inassociation with information of the position of the diagnosis target onthe full view spherical image, the relationship between the annotationand the diagnosis target input with the annotation can be easilyrecognized, with which a burden of creating a report can be reduced.Therefore, the occurrence of human error when creating the report can bereduced.

The full view spherical image is an image captured with an angle of 360degrees on orthogonal two planes (e.g., horizontal plane and verticalplane) and an angle of view of 4π steradians relative to an imagecapture position. Further, the annotation indicates information based ona result of observing and diagnosing the diagnosis target. Theannotation can include one or more items defined in advance, and anycomments for each type of diagnosis target. The annotation can furtherinclude image data. Further, in this disclosure, the spherical imagedoes not have to be the full-view spherical image of a full 360 degreesin the horizontal direction and/or the vertical direction. For example,the spherical image may be a wide-angle view image having an angle ofanywhere from 180 to any amount less than 360 degrees in the horizontaldirection.

Image Capture Apparatus of First Embodiment:

FIG. 1 is an example of an appearance view of an image capture apparatus1 a of a first embodiment. As illustrated in FIG. 1, the image captureapparatus 1 a includes, for example, a housing 10 a having asubstantially rectangular shape, and imaging lenses 20 a and 20 b, inwhich the imaging lens 20 a is provided on a first surface of thehousing 10 a and the imaging lens 20 b is provided on a second surfaceof the housing 10 a, which is an opposite position corresponding to aposition of the imaging lens 20 a. The imaging lens can be also referredto as image capture lens, image-capturing lens, or image capture lens.Further, an image capture element corresponding to each of the imaginglenses 20 a and 20 b is provided in the housing 10 a. Hereinafter, theimaging lenses 20 a and 20 b may be simply referred to as the imaginglens 20.

Light incident on the imaging lens 20 a is irradiated onto thecorresponding image capture element via an image-focusing optical systemincluding the imaging lens 20 a provided for the housing 10 a, and lightincident on the imaging lens 20 b is irradiated onto the correspondingimage capture element via an image-focusing optical system including theimaging lens 20 b provided for the housing 10 a, respectively. The imagecapture element employs, for example, a charge coupled device (CCD),which is a light receiving element that converts irradiated light intoelectric charges, but not limited thereto. For example, the imagecapture element can be a complementary metal oxide semiconductor (CMOS)image sensor.

Although details are to be described later, each drive unit for drivingeach of the image capture elements performs a shutter control for eachof the image capture elements based on a trigger signal, and readselectric charges converted from the light, from each of the imagecapture elements. Each drive unit converts the electric charges receivedfrom each of the image capture elements into electrical signals,converts the electric signals into digital image data, and outputs thedigital image data. The digital image data of each captured image outputfrom each drive unit can be stored in, for example, a memory.

In the following description, a configuration including a pair of theimaging lens 20 a and imaging lens 20 b, and the image-focusing opticalsystem and the image capture element corresponding to the set of imagingthe imaging lens 20 a and imaging lens 20 b is to be described as animage capture unit 21. For example, an operation of outputting capturedimage data based on light incident on the imaging lenses 20 a and 20 bin accordance with the trigger signal will be described as an imagecapture operation by the image capture unit 21 for the convenience ofdescription. If the imaging lenses 20 a and 20 b are required to bedistinguished with each other, it is described, for example, that theimaging lens 20 a performs the image capture operation.

The image capture apparatus 1 a includes a shutter button 30, which isused for instructing an image capture operation by the image captureunit 21 in response to an operation to the shutter button 30. When theshutter button 30 is operated, the image capture operation using each ofthe imaging lenses 20 a and 20 b is performed in a synchronized mannerin the image capture unit 21.

As illustrated in FIG. 1, the housing 10 a of the image captureapparatus 1 a includes, for example, an image capture portion 2 a, inwhich the image capture unit 21 is disposed, and an operation portion 3a, in which the shutter button 30 is disposed. The operation portion 3 aincludes, for example, a grip portion 31 for holding the image captureapparatus 1 a by a user. The grip portion 31 is formed so that itssurface is less slippery so that the user can hold and manipulate thegrip portion 31. Further, a fixing portion 32 for fixing the imagecapture apparatus 1 a to a tripod or the like is provided on a bottom ofthe grip portion 31.

Hereinafter, a description is given of a structure of the image captureunit 21 with reference to FIG. 2. FIG. 2 illustrates an example of astructure of the image capture unit 21 according to the firstembodiment. As illustrated in FIG. 2, the image capture unit 21includes, for example, image-focusing optical systems 201 a and 201 bincluding the imaging lenses 20 a and 20 b, and image capture elements200 a and 200 b using, for example, charged-coupled device (CCD) sensoror complementary metal-oxide-semiconductor (CMOS) sensor. Each of theimage-focusing optical systems 201 a and 201 b is composed of, forexample, seven fisheye lenses configuring six groups. The fisheye lenshas an angle of view of 180 degrees (=360 (deg)/n, n is an opticalcoefficient that is equal to two), preferably greater than 180 degrees,more preferably 185 degrees or more, and further preferably 190 degreesor more.

Each of the image-focusing optical system 201 a and 201 b includes, forexample, prisms 202 a and 202 b that change an optical path for 90degrees, respectively. The seven fisheye lenses of six groups includedin each of the image-focusing optical systems 201 a and 201 b can bedivided into one group disposed at an entrance side of the prisms 202 aand 202 b, and another group disposed at an exit side of the prisms 202a and 202 b (facing the image capture elements 200 a and 200 b). Forexample, in the image-focusing optical system 201 a, light incident onthe imaging lens 20 a enters the prism 202 a through each of the lensesbelonging to the one group disposed at the incident side of the prism202 a. Then, the light that has entered the prism 202 a changes theoptical path for 90 degrees, and then irradiates the image captureelement 200 a through each lens, an aperture stop, and a filterbelonging to another group disposed at the exit side of the prism 202 a.

The optical elements (lenses, prisms 202 a and 202 b, filters andaperture stops) of the image-focusing optical systems 201 a and 201 bare disposed at given positions designed with respect to the imagecapture elements 200 a and 200 b. More specifically, positions of theoptical elements are designed so that the optical axes of the opticalelements of the image-focusing optical systems 201 a and 201 b areorthogonal to the center of the light receiving areas of thecorresponding image capture elements 200 a and 200 b, and each of thelight receiving areas becomes an imaging plane of the correspondingfisheye lens. In the image capture unit 21, the image-focusing opticalsystems 201 a and 201 b use the same design specifications, and theoptical axes of the image-focusing optical systems 201 a and 201 b arematched with each other while the directions of the optical axes areopposite with each other.

FIGS. 3A, 3B and 3C (FIG. 3) are schematic trihedral views (thee-sidedviews) of appearance of the image capture apparatus 1 a. FIGS. 3A, 3Band 3C respectively correspond to a top view, a front view, and a sideview of the image capture apparatus 1 a. As illustrated in FIG. 3C, theimage capture unit 21 is configured by the imaging lens 20 a and theimaging lens 20 b that is disposed at a position, opposite to a positionof the imaging lens 20 a, which is the rear side of the imaging lens 20a.

In FIG. 3A and FIG. 3C, an angle “a” indicates an angle of view of theimaging lenses 20 a and 20 b, indicating an image capture range. Asdescribed above, each of the imaging lenses 21 a and 20 b included inthe image capture unit 21 captures images using the angle “a” greaterthan 180 degrees as the angle of view. Therefore, in order to preventinclusion of images of the housing 10 a in each image captured by theimaging lenses 20 a and 20 b, as illustrated as surfaces 23-1, 23-2,23-3, and 23-4 in FIGS. 3A, 3B and 3C, both sides with respect to thecenter line C of each of the imaging lenses 20 a and 20 b disposed onthe first and second faces are chamfered in accordance with the angle ofview of each imaging lenses 20 a and 20 b.

The center line C is a line which passes through the centers of theimaging lenses 20 a and 20 b in a vertical direction when the attitudeof the image capture apparatus 1 a is set so that the direction of thevertex (pole) of each hemispherical image captured by the image captureapparatus 1 a is parallel to the vertical direction.

The image capture unit 21 uses a combination of the imaging lenses 20 aand 20 b to set an image capture range of a full view spherical imagehaving a center aligned to the center of the imaging lenses 20 a and 20b. That is, as described above, the imaging lenses 20 a and 20 b have anangle of 180 degrees or more, preferably greater than 180 degrees, andmore preferably 185 degrees or more. Therefore, by combining the imaginglenses 20 a and 20 b, for example, an image capture range of a planeperpendicular to the first plane and an image capture range of a planeparallel to the first surface can be respectively set with 360 degrees,so that an image capture range of the full view spherical image can beimplemented using the combination of the imaging lenses 20 a and 20 b.

Configuration of Signal Processing of Image Capture Apparatus of FirstEmbodiment:

FIG. 4 is an example of a hardware block diagram of the image captureapparatus 1 a according to the first embodiment. In FIG. 4, portionscorresponding to FIGS. 1 and 2 are denoted by the same referencenumerals, and a detailed description thereof will be omitted.

As illustrated in FIG. 4, the image capture apparatus 1 a includes, forexample, the image capture unit 21 including the image capture elements200 a and 200 b, drive units 210 a and 210 b, and signal processingunits 211 a and 211 b as a configuration of an imaging system. Further,the image capture apparatus 1 a includes, for example, a centralprocessing unit (CPU) 2000, a read only memory (ROM) 2001, a memorycontroller 2002, a random access memory (RAM) 2003, a trigger interface(I/F) 2004, a switch (SW) circuit 2005, a data interface (I/F) 2006, anda communication interface (I/F) 2007, and an acceleration sensor 2008,which are connected to a bus 2010 as a configuration of an imagingcontrol system and a signal processing system. Further, the imagecapture apparatus 1 a includes, for example, a battery 2020 forsupplying power to each of these components.

At first, a configuration of the imaging system is described. In theconfiguration of the imaging system, the image capture element 200 a andthe drive unit 210 a are disposed for the imaging lens 20 a. Similarly,the image capture element 200 b and the drive unit 210 b are disposedfor to the imaging lens 20 b. The image capture unit 21 includes theimage capture element 200 a and the drive unit 210 a as one set, and theimage capture element 200 b and the drive unit 210 b as another one set.

In the image capture unit 21, the drive unit 210 a drives the imagecapture element 200 a and receives electric charges from the imagecapture element 200 a in accordance with a trigger signal supplied fromthe trigger I/F 2004. The drive unit 210 a outputs a captured image ofone frame based on the electric charges received from the image captureelement 200 a in accordance with one trigger signal. The drive unit 210a converts the electric charges received from the image capture element200 a into electrical signals and outputs the electric signals. Theelectric signals output from the drive unit 210 a is supplied to thesignal processing unit 211 a. The signal processing unit 211 a performsgiven signal processing, such as noise removal and gain adjustment forthe electric signals supplied from the drive unit 210 a, converts theelectric signals of analog type into the electric signals of digitaltype, and outputs the digital signals as digital data of the capturedimage. The captured image is a hemispherical image (fish eye image), inwhich the hemisphere region corresponding to the angle of view of theimaging lens 20 a is captured as a part of the full view sphericalimage.

In the image capture unit 21, the image capture element 210 b and thedrive unit 210 b have functions similar to those of the image captureelement 200 a and the drive unit 210 a, and thereby the descriptionthereof is omitted. Further, since the signal processing unit 211 b hasthe same function as that of the signal processing unit 211 a describedabove, the description thereof is omitted.

Each hemispherical image output from each of the signal processing units211 a and 211 b is stored in the RAM 2003 under the control of thememory controller 2002.

Hereinafter, a description is given of a configuration of the imagingcontrol system and the signal processing system. The CPU 2000 executesone or more programs stored in the ROM 2001 in advance using a part of astorage area of the RAM 2003 as a working memory to control the overalloperation of the image capture apparatus 1 a. The memory controller 2002controls data storage and reading to and from the RAM 2003 in accordancewith instruction of the CPU 2000.

The SW circuit 2005 detects an operation on the shutter button 30 andtransfers a detection result to the CPU 2000. When the CPU 2000 receivesthe detection result indicating that the operation to the shutter button30 is detected from the SW circuit 2005, the CPU 2000 outputs a triggersignal. The trigger signal is output via the trigger I/F 2004, and thensupplied to each of the drive units 210 a and 210 b.

The data I/F 2006 is an interface that is used for performing datacommunication with an external device. As to the data I/F 2006, forexample, universal serial bus (USB) or Bluetooth (registered trademark)can be applied. The communication I/F 2007 is connected to a network,and controls communication with the network. The network connected tothe communication I/F 2007 can be any one of wired network and wirelessnetwork or can be a combination of wired network and wireless network.

In the above description, the CPU 2000 outputs the trigger signal inaccordance with the detection result of the SW circuit 2005, but notlimited thereto. For example, the CPU 2000 can be configured to outputthe trigger signal in accordance with a signal supplied via the data I/F2006 and the communication I/F 2007. Further, the trigger I/F 2004 canbe configured to generate the trigger signal in accordance with thedetection result of the SW circuit 2005 and to supply the trigger signalto each of the drive units 210 a and 210 b.

The acceleration sensor 2008 detects acceleration component in the threeaxes and transfers a detection result to the CPU 2000. The CPU 2000detects the vertical direction based on the detection result of theacceleration sensor 2008 and calculates an inclination of the imagecapture apparatus 1 a in the vertical direction. The CPU 2000 adds theinclination information indicating the inclination of the image captureapparatus 1 a for each hemispherical image, captured by the imaginglenses 20 a and 20 b in the image capture unit 21 and stored in the RAM2003.

The battery 2020 is, for example, a secondary battery, such as a lithiumion secondary battery, and is a power supply unit for supplying electricpower to each part of the image capture apparatus 1 a that needs to besupplied with power. The battery 2020 includes, for example, acharge/discharge control circuit for controlling charge and discharge toand from the second battery.

FIG. 5 is an example of a diagram illustrating an arrangement of thebattery 2020 and a circuit unit 2030 in the image capture apparatus 1 aaccording to the first embodiment. As illustrated in FIG. 5, the battery2020 and the circuit unit 2030 are disposed inside the housing 10 a. Asto the battery 2020 and the circuit unit 2030, at least the battery 2020is fixed inside the housing 10 a by fixing means, such as adhesive andscrew. Further, FIG. 5 illustrates an example of a view of the imagecapture apparatus 1 a viewed from the front side, which corresponds toFIG. 3B. The circuit unit 2030 includes, for example, each element ofthe above described imaging control system and signal processing system,and, for example, can be configured on one or more of circuit boards.

Although the battery 2020 and the circuit unit 2030 can be arranged atgiven positions as above described, but not limited thereto. Forexample, if the circuit unit 2030 is sufficiently small, at least thebattery 2020 alone may be disposed at a given position.

In such configuration, the trigger signal is output from the trigger I/F2004 in response to an operation on the shutter button 30. The triggersignal is supplied to each of the drive units 210 a and 210 b at thesame timing. Each of the drive units 210 a and 210 b receives electriccharges from the respective image capture elements 200 a and 200 b insynchronized with the supplied trigger signal.

Each of the drive units 210 a and 210 b converts the electric chargesreceived from each of the image capture element 200 a and 200 b intoelectrical signals and supplies the electrical signals to each of thesignal processing units 211 a and 211 b. Each of the signal processingunits 211 a and 211 b performs given processing on each of theelectrical signals supplied from each of the drive units 210 a and 210 band converts each of electrical signals into image data of ahemispherical image and outputs the image data of hemispherical image.Each of the image data of hemispherical image is stored in RAM 2003under the control of the memory controller 2002.

Each of the image data of hemispherical image stored in the RAM 2003 istransmitted to an external information processing apparatus via the dataI/F 2006 or the communication I/F 2007.

Image Processing of First Embodiment:

Hereinafter, a description is given of an image processing on the imagedata of hemispherical image (hereinafter, simply referred to ashemispherical image) according to the first embodiment. FIG. 6 is anexample of a hardware block diagram of an information processingapparatus 100 a, which can be used as an input apparatus for inputtingan annotation according to the first embodiment. As illustrated in FIG.6, the information processing apparatus 100 a includes, for example, aCPU 1000, a ROM 1001, a RAM 1002, a graphic I/F 1003, a storage 1004, adata I/F 1005, a communication I/F 1006, an input device 1011, which areconnected with each other via a bus 1030, and further a display device1010 connected to the graphic I/F 1003.

The storage 1004 is a non-volatile memory, such as a flash memory, andstores one or more programs and various data used for operating the CPU1000. The storage 1004 can use a hard disk drive. The CPU 1000 executesone or more programs stored in the ROM 1001 and the storage 1004 usingthe RAM 1002 as a working memory to control the overall operation of theinformation processing apparatus 100 a.

Based on a display control signal generated by the CPU 1000, the graphicI/F 1003 generates a display signal that can be displayed by the displaydevice 1010 and transmits the generated display signal to the displaydevice 1010. The display device 1010 includes, for example, a liquidcrystal display (LCD) and a drive circuit for driving the LCD, anddisplays a screen image corresponding to the display signal transmittedfrom the graphic I/F 1003.

The input device 1011 outputs a signal corresponding to a user operationand receives a user input. In this example case, the informationprocessing apparatus 100 a is a tablet type personal computer, and thedisplay device 1010 and the input device 1011 are integrally configuredas a touch panel 1020. The input device 1011 may be displayed on thedisplay device 1010, and outputs position information corresponding to acontact position on the touch panel 1020. The information processingapparatus 100 a is not limited to the tablet type personal computer, butcan be, for example, a desktop personal computer.

The data I/F 1005 is an interface for performing data communication withan external device. As to the data I/F 1005, for example, USB andBluetooth (registered trademark) can be employed. The communication I/F1006 is connected to a network using wireless communication, andcontrols communication to the network. The communication I/F 1006 can beconnected to the network via wired communication. In this example case,it is assumed that the image capture apparatus 1 a and the informationprocessing apparatus 100 a are connected via the data I/F 1005 usingwired communication, but not limited thereto.

FIG. 7 is an example of a functional block diagram of the informationprocessing apparatus 100 a according to the first embodiment. Asillustrated in FIG. 7, the information processing apparatus 100 aincludes, for example, an image acquisition unit 110, an imageprocessing unit 111, an additional information generation unit 112, auser interface (UI) unit 113, a communication unit 114, and an outputunit 115.

The image acquisition unit 110, the image processing unit 111, theadditional information generation unit 112, the UI unit 113, thecommunication unit 114, and the output unit 115 are implemented byoperating an input program according to the first embodiment on the CPU1000, but not limited thereto. For example, a part or all of the imageacquisition unit 110, the image processing unit 111, the additionalinformation generation unit 112, the UI unit 113, the communication unit114, and the output unit 115 can be configured as one or more hardwarecircuits that operate in cooperation with each other.

The UI unit 113, which is receiving means, receives a user input inresponse to a user operation to the input device 1011, and executesprocessing in accordance with the received user input. The UI unit 113can store information received in accordance with the user input in theRAM 1002 or the storage 1004. The UI unit 113 is also a display unitthat generates a screen image to be displayed on the display device1010. The communication unit 114 controls communication via the data I/F1005 and the communication I/F 1006.

The image acquisition unit 110 acquires, via the data I/F 1005, eachhemispherical image captured by the imaging lenses 20 a and 20 b fromthe image capture apparatus 1 a, and the inclination information addedto each hemispherical image. The image processing unit 111 performs animage conversion process to generate a full view spherical image bystitching each hemispherical image acquired by the image acquisitionunit 110. The additional information generation unit 112 generates anadditional information (e.g., annotation) at a designated position onthe full view spherical image generated by the image processing unit 111in accordance with the user input received by the UI unit 113. Theoutput unit 115 creates and outputs a report with a given format basedon the annotation generated by the additional information generationunit 112.

The input program for implementing each function according to the firstembodiment in the information processing apparatus 100 a is recorded ona recording medium readable by a computer, such as compact disk (CD),flexible disk (FD), or digital versatile disk (DVD) in a file of aninstallable format or an executable format. Further, the input programcan be provided by storing the input program on a computer connected toa network such as the Internet and downloading the input program via thenetwork. Further, the input program can be provided or distributed viathe network such as the Internet.

The input program includes, for example, a module configurationincluding the image acquisition unit 110, the image processing unit 111,the additional information generation unit 112, the UI unit 113, thecommunication unit 114, and the output unit 115. As the actual hardware,when the CPU 1000 reads the input program from a storage medium such asthe storage 1004 and executes the input program, each of the abovedescribed units are loaded on a main storage device such as the RAM1002, and the image acquisition unit 110, the image processing unit 111,the additional information generation unit 112, the UI unit 113, thecommunication unit 114, and the output unit 115 are generated on themain storage device.

Image Conversion in First Embodiment:

Hereinafter, a description is given of an image conversion processaccording to the first embodiment with reference to FIGS. 8 to 12. Theimage conversion process described below is performed, for example, bythe image processing unit 111 in the information processing apparatus100 a.

FIG. 8 is an example of a diagram for describing how the imaging lenses20 a and 20 b applied to the first embodiment project three-dimensionalincident light in a two-dimensional space. Hereinafter, the imaginglenses 20 a and 20 b are described using the imaging lens 20 a as arepresentative of the imaging lenses 20 a and 20 b unless otherwisedescribed.

As illustrated in FIG. 8A, the imaging lens 20 a includes, for example,a fisheye lens 24 (image-focusing optical system 201 a) and the imagecapture element 200 a. An axis perpendicular to a light receivingsurface of the image capture element 200 a is defined as an opticalaxis. In an example case in FIG. 8A, an incident angle φ is expressed asan angle with respect to a vertex of the optical axis, in which thevertex of the optical axis is at the intersection of a plane, contactingwith the edge of the fisheye lens 24, and the optical axis.

A fisheye image (hemispherical image) captured by the fisheye lens 24having an angle of view greater than 180 degrees becomes an image of ascene of a hemisphere from an image capture position. As illustrated inFIG. 8A and FIG. 8B, a hemispherical image 22 is generated with an imageheight “h” corresponding to the incident angle “φ,” wherein therelationship of the image height “h” and the incident angle “φ” isdetermined by the projection function f(φ). In FIG. 8B, a region inwhich the light from the fisheye lens 24 is not irradiated on the imagecapture surface of the image capture element 200 a is indicated withblack solid image to indicate an invalid region. The projection functionf(φ) varies depending on properties of the fisheye lens 24. For example,when the image height “h,” the focal length “f,” the incident angle “φ”(incident angle defined by incidence direction and the optical axis) areset, the fisheye lens 24 using a projection system corresponding to theequidistant projection system represented by the following formula (1)can be used in this description.

h=f×ϕ  (1)

FIG. 9 is a schematic view for describing an inclination of the imagecapture apparatus 1 a according to the first embodiment. In FIG. 9, thevertical direction corresponds to the z axis in the orthogonalcoordinate of the x-y-z three-dimensional direction of the globalcoordinate system. When the vertical direction is parallel with thecenter line C of the image capture apparatus 1 a illustrated in FIG. 3B,the camera is not tilted. If the center line C of the image captureapparatus 1 a is not parallel to the vertical direction, the imagecapture apparatus 1 a is in an inclined state.

The image capture apparatus 1 a associates each hemispherical imagecaptured by the imaging lenses 20 a and 20 b with an output value outputfrom the acceleration sensor 2008 at the time of the imaging and storeseach hemispherical image and output value, for example, in the RAM 2003.The information processing apparatus 100 a acquires each hemisphericalimage and the output value of the acceleration sensor 2008 stored in theRAM 2003 from the image capture apparatus 1 a.

In the information processing apparatus 100 a, the image processing unit111 calculates an inclination angle “α” from a gravity vector(hereinafter, inclination angle “α”) and an inclination angle β″ in thex-y plane (hereinafter, inclination angle “(β”) using the output valueof the acceleration sensor 2008 acquired from the image captureapparatus 1 a and the following formulas (2) and (3). In the followingformulas (2) and (3), a value “Ax” indicates a value of the x0-axisdirection component of the camera coordinate system in the output valueof the acceleration sensor 2008, a value “Ay” indicates a value of they0-axis direction component of the camera coordinate system in theoutput value of the acceleration sensor 2008, and a value “Az” indicatesa value of the z0-axis direction component of the camera coordinatesystem in the output value of the acceleration sensor 2008. The imageprocessing unit 111 calculates the inclination angle “α” and theinclination angle “β” from the values of the respective axial componentsof the acceleration sensor 2008 in accordance with the trigger signal.The image processing unit 111 associates the calculated inclinationangle “α” and inclination angle “β” with each hemispherical imageacquired from the image capture apparatus 1 a, and stores the calculatedinclination angles “α” and “β” and each hemispherical image, forexample, in the storage 1004.

$\begin{matrix}{\alpha = {{Arc}\; {\tan \left( \frac{Ax}{Ay} \right)}}} & (2) \\{\beta = {{Arc}\; {\cos\left( \frac{Az}{\sqrt{{Ax}^{2} + {Ay}^{2} + {Az}^{2}}} \right)}}} & (3)\end{matrix}$

The image processing unit 111 generates a full view spherical imagebased on each hemispherical image acquired from the image captureapparatus 1 a, the inclination angle “α” and the inclination angle “β”associated with each hemispherical image.

FIGS. 10A and 10B (FIG. 10) are an example of diagram for describing aformat of the full view spherical image according to the firstembodiment. FIG. 10A is an example of a format when the full viewspherical image is represented by a plane, and FIG. 10B is an example ofa format when the full view spherical image is represented by aspherical face. When the full view spherical image is represented usingthe plane format, as illustrated in FIG. 10A, the full view sphericalimage becomes an image having a pixel value corresponding to an angularcoordinates (φ, θ) with respect to the horizontal angle of 0 to 360degrees and the vertical angle of 0 to 180 degrees. The angularcoordinates (φ, θ) are associated with each point of a coordinate pointon a spherical face illustrated in FIG. 10B, which are similar to alatitude and a longitude coordinate on a globe.

The relationship between the plane coordinate value of the imagecaptured by the fisheye lens and the spherical coordinate value of thefull view spherical image can be mapped by using the projection function“f (h=f(θ))” described in FIG. 8. Thus, by converting the two partialimages (two hemispherical images) captured by the fisheye lenses andcombining (synthesizing) the two partial images (two hemisphericalimages), the full view spherical image can be created as a plane imageas illustrated in FIG. 10A.

In the first embodiment, a conversion table that associates each pixelposition on the hemispherical image with each pixel position on the fullview spherical image of plane image illustrated in FIG. 10A is createdin advance and stored in the storage 1004 of the information processingapparatus 100 a. Table 1 is an example of the conversion table. FIG. 11is an example of a diagram for describing correspondence between eachpixel position on the hemispherical image and each pixel position on thefull view spherical image associated by the conversion table accordingto the first embodiment.

TABLE 1 Coordinate values of pre-conversion Coordinate values ofconverted image image θ (pixel) φ (pixel) x (pixel) y (pixel) 0 0 ...... 1 0 ... ... ... ... ... ... 3598 1799 ... ... 3599 1799 ... ...

As illustrated in Table 1, the conversion table has a data set ofcoordinate values (θ, φ) [pixel] of a converted image and coordinatevalues (x, y) [pixel] of a pre-conversion image for each of coordinatevalues of the converted image. The converted image can be generated fromthe captured hemispherical image (pre-conversion image) using theconversion table illustrated in Table 1. Specifically, as illustrated inFIG. 11, based on the correspondence relationship between thepre-conversion coordinates and the converted coordinates indicated inthe conversion table (Table 1), each pixel of the converted image can begenerated by referring to the pixel value of the coordinate values (x,y) [pixel] of the pre-conversion image corresponding to the coordinatevalues (θ, φ) [pixel].

The conversion table, such as Table 1, reflects the distortioncorrection, assuming that the direction of the center line C of theimage capture apparatus 1 a is parallel to the vertical direction. Byperforming the correction process in accordance with the inclinationangles “α” and “β” to the conversion table, the correction (i.e.,vertical correction) to set the center line C and the vertical directionof the image capture apparatus 1 a in parallel with each other can beperformed.

FIGS. 12A and 12B (FIG. 12) is an example of diagrams for describing avertical correction according to the first embodiment. FIG. 12A is acamera coordinate system, and FIG. 12B is a global coordinate system,respectively. In FIG. 12B, the three-dimensional Cartesian coordinate ofa global coordinate system is denoted by (x1, y1, z1) and the sphericalcoordinate is denoted by (θ1, φ1). In FIG. 12A, the three-dimensionalCartesian coordinate of the camera coordinate system is denoted by (x0,y0, z0) and the spherical coordinate is denoted by (θ0, φ0).

The image processing unit 111 performs the vertical correctionprocessing using the following formulas (4) through (9) to convert thespherical coordinates (θ1, φ1) to the spherical coordinates (θ0, φ0). Atfirst, in order to correct the inclination, it is necessary to performthe rotation conversion using the three-dimensional orthogonalcoordinate, and thereby the image processing unit 111 performs theconversion from the spherical coordinates (θ1, φ1) to thethree-dimensional orthogonal coordinates (x1, y1, z1) using thefollowing formulas (4) to (6).

x1=sin(ϕ1)cos(θ1)  (4)

y1=sin(ϕ1)sin(ϕ1)  (5)

z1=cos(ϕ1)  (6)

Then, the image processing unit 111 performs the rotation coordinatetransformation indicated by the following formula (7) using theinclination angle (α, β) to convert the global coordinate system (x1,y1, z1) into the camera coordinate system (x0, y0, z0). In other words,the formula (7) provides a definition of the inclination angle (α, β).

$\begin{matrix}{\begin{pmatrix}{x\; 0} \\{y\; 0} \\{z\; 0}\end{pmatrix} = {\begin{pmatrix}{\cos \; \alpha} & {\sin \; \alpha} & 0 \\{{- \sin}\; \alpha} & {\cos \; \alpha} & 0 \\0 & 0 & 1\end{pmatrix}\begin{pmatrix}1 & 0 & 0 \\0 & {\cos \; \beta} & {\sin \; \beta} \\0 & {{- \sin}\; \beta} & {\cos \; \beta}\end{pmatrix}\begin{pmatrix}{x\; 1} \\{y\; 1} \\{z\; 1}\end{pmatrix}}} & (7)\end{matrix}$

This means that if the global coordinate system is rotated around thez-axis for “α” rotation at first, and then rotated around the x-axis for“β” rotation, the global is coordinate becomes the camera coordinatesystem. Finally, the image processing unit 111 converts thethree-dimensional orthogonal coordinates (x0, y0, z0) of the cameracoordinate system into the spherical coordinate (θ0, φ0) using thefollowing formulas (8) and (9).

$\begin{matrix}{{\varphi \; 0} = {{Arc}\; {\cos \left( {z\; 0} \right)}}} & (8) \\{{\theta \; 0} = {{Arc}\; {\tan \left( \frac{y\; 0}{x\; 0} \right)}}} & (9)\end{matrix}$

As described above, the coordinate conversion is performed by executingthe vertical correction processing, but not limited thereto. Forexample, a plurality of conversion tables in accordance with theinclination angle (α, β) can be prepared and stored in advance. Withthis configuration, the vertical correction processing can be omitted,with which the total processing can be performed faster.

Input Processing of First Embodiment:

Hereinafter, a description is given of an input processing of anannotation according to the first embodiment. FIGS. 13A and 13B (FIG.13) schematically illustrate examples of configurations of aninformation processing system according to the first embodiment. Asillustrated in FIG. 13A, the information processing system includes, forexample, the image capture apparatus 1 a, the information processingapparatus 100 a connected by wired or wireless communication with theimage capture apparatus 1 a, and a server 6 connected to the informationprocessing apparatus 100 a via a network 5.

Each hemispherical image captured by each of the imaging lenses 20 a and20 b in the image capture apparatus 1 a is transmitted to theinformation processing apparatus 100 a. Then, the information processingapparatus 100 a converts the hemispherical images transmitted from theimage capture apparatus 1 a into the full view spherical image asdescribed above. Further, the information processing apparatus 100 aadds an annotation to the full view spherical image, converted from thehemispherical images, in accordance with a user input. Then, theinformation processing apparatus 100 a transmits, for example, the fullview spherical image and the annotation to the server 6 via the network5. Then, the server 6 stores and manages the full view spherical imageand annotation transmitted from the information processing apparatus 100a.

Further, the information processing apparatus 100 a acquires the fullview spherical image and annotation from the server 6 and creates reportdata based on the acquired full view spherical image and annotation, butnot limited thereto. For example, the information processing apparatus100 a can also generate the report data based on the full view sphericalimage and the annotation stored in the information processing apparatus100 a. Then, the information processing apparatus 100 a transmits thegenerated report data to the server 6.

In an example case in FIG. 13A, the server 6 is composed as a singlecomputer, but not limited thereto. That is, the server 6 can beconfigured using a plurality of computers by distributing the functionsof the server 6 to the plurality of computers. Further, the server 6 canbe configured as a cloud system on the network 5.

Further, as illustrated in FIG. 13B, another configuration can be alsoemployed, in which the image capture apparatus 1 a and the informationprocessing apparatus 100 a are connected to the network 5. In thisconfiguration, each of the hemispherical images captured by the imagecapture apparatus 1 a is transmitted to the server 6 via the network 5.Then, the server 6 performs the conversion processing on each of thehemispherical images as described above to convert the respectivehemispherical images into the full view spherical image and store thefull view spherical image. Then, the information processing apparatus100 a adds the annotation to the full view spherical image stored in theserver 6.

Further, the first embodiment is not limited to the aboveconfigurations. For example, the information processing apparatus 100 amay not be connected to the server 6, in which the informationprocessing apparatus 100 a performs the processing locally.

FIG. 14 is an example of a diagram for describing an image captureoperation using the image capture apparatus 1 a according to the firstembodiment. For example, the full view spherical image can be capturedby using the image capture apparatus 1 a within a structure 4 to bediagnosed (i.e., diagnosis target). The image can be captured at aplurality of positions in the structure 4. Since the full view sphericalimage can be captured by one-time imaging capture operation, missing ofcapturing a part of the interior of the structure 4 (including theceiling) can be reduced or prevented. Further, in FIG. 14, the imagecapture apparatus 1 a is used to capture images of the interior of thestructure 4, but not limited thereto. For example, images of theexterior surface of the structure 4 can be also captured using the imagecapture apparatus 1 a. In this case, for example, by performing aplurality of image capture operations while moving around the entirecircumference of the structure 4, the images of the exterior surface(e.g., wall surface) of the structure 4 can be captured without missinga part of the exterior surface of the structure 4. As above described,the image capture apparatus 1 a can be used to capture images in a spacewhere the image capture apparatus 1 a is set, such as the image captureapparatus 1 a set inside the structure 4 (e.g., room). The space forsetting the image capture apparatus 1 a is not limited to an enclosedspace, such as a room, but the image capture apparatus 1 a can be set inan open space, such as a park, a street, or the like to capture imagesof objects existing in the space.

Hereinafter, a description is given of an annotation input processing inthe information processing system according to the first embodiment withreference to FIGS. 15A to 15D. In the information processing systemaccording to the first embodiment, auxiliary functions are provided toassist the annotation input to the full view spherical image.Hereinafter, a function of automatically estimating an image capturingposition, three-dimensional (3D) panoramic automatic tour function forimplementing a movement of viewpoints between a plurality of full viewspherical images, and an input and confirmation function of annotationare schematically described as examples of the auxiliary functions.

FIG. 15A is an example of a diagram for describing an automaticestimation function of an image capture position according to the firstembodiment. In FIG. 15A, a screen 500 displayed on the display device1010 displays, for example, a partial image 5010 of a given region inthe full view spherical image generated from the hemispherical imagescaptured for generating the full view spherical image of the structure 4to be diagnosed (i.e., diagnosis target). Further, a floor plan image5011 of the structure 4 acquired in advance is displayed on the screen500.

In an example illustrated in FIG. 15A, a pin marker 5040-1, a pin marker5040-2, and a pin marker 5040-3 are displayed on the floor plan image5011 to indicate image capture positions where the images are captured.

For example, images of a diagnosis target are captured for a pluralityof times using the image capture apparatus 1 a while moving the imagecapture positions, and then a pair of hemispherical images captured bythe imaging lenses 20 a and 20 b are acquired. In the informationprocessing apparatus 100 a, the image processing unit 111 generates aplurality of full view spherical images based on a plurality ofhemispherical images and performs a matching process on the generatedfull view spherical images using a known method. The image processingunit 111 estimates the relative image capture position of each of thefull view spherical images based on a result of the matching process andthe respective hemispherical images before converting to the full viewspherical images. Each image capture position in the structure 4 can beestimated by rotating and scaling each set of image capture positionswhile maintaining the relative positional relationship of the estimatedimage capture positions and by associating each image capture positionwith the floor plan image 5011.

Further, in the information processing apparatus 100 a, the UI unit 113causes the screen 500 to display a cursor 5041 capable of arbitrarilymoving a position in response to a user operation. The UI unit 113causes the screen 500 to display the partial image 5010 based on thefull view spherical image corresponding to the pin marker designated bythe cursor 5041 among the pin markers 5040-1, 5040-2, and 5040-3.

In this way, by automatically estimating the image capturing positionsof the plurality of full view spherical images, a user can easilyrecognize a relationship between a plurality of the full view sphericalimages, thereby improving the work efficiency.

Further, the UI unit 113 can display a tag at a position designated bythe user on the partial image 5010. In an example case in FIG. 15A, tags5030 a and 5030 b are displayed on the partial image 5010. Althoughdetails will be described later, the tags 5030 a and 5030 b indicatelocations of the diagnosis target to which the annotations are input,and are associated with coordinates of the full view spherical imageincluding the partial image 5010.

FIGS. 15B and 15C are an example of diagrams for describing the 3Dpanoramic automatic tour function according to the first embodiment. Forexample, as illustrated in FIG. 15B, it is assumed that a pin marker5040-10 is designated on the floor plan image 5011, and a partial image5010 a (see FIG. 15C) corresponding to the position of the pin marker5040-10 is displayed on the screen 500. In this state, when a movement(indicated by an arrow in FIG. 15B) to the left oblique front isinstructed by the cursor 5041, the UI unit 133 shifts a designation ofpin marker from the pin marker 5040-10 to a pin marker 5040-11immediately before the left oblique front of the pin marker 5040-10, andcauses the screen 500 to display a partial image 5010 b (see FIG. 15C)corresponding to the pin marker 5040-11 based on the full view sphericalimage. Similarly, when the UI unit 133 shifts a designation of pinmarker from the pin marker 5040-10 to a pin marker 5040-12 (indicated byan arrow in FIG. 15B) displayed immediately before the right obliquefront of the pin marker 5040-10, the UI unit 113 causes the screen 500to display a partial image 5010 c (see FIG. 15C) corresponding to thepin marker 5040-12 based on the full view spherical image.

In this manner, by designating the movement on the screen 500, forexample, using the pin markers 5040-1, 5040-2, and 5040-3, thedesignated pin marker shifts to the adjacent pin marker in the movementdirection, and the partial image 5010 displayed on the screen 500 can beswitched one to another in accordance with the shifting of pin marker asindicated in FIG. 15C. With this configuration, a user can operate as ifhe or she is observing the diagnosis target sequentially, thereby makingit possible to improve the efficiency of the diagnosis operation.

FIG. 15D is an example of a diagram for describing an input andconfirmation function of the annotation according to the firstembodiment. The UI unit 113 designates a desired position on the partialimage 5010 using the cursor 5041 on the screen 500, and a tag 5030 aindicating the desired position is superimposed and displayed on thepartial image 5010. Further, an annotation input screen 600 forinputting annotation information (hereinafter, annotation) correspondingto the position of the tag 5030 a is displayed on the screen 500 withthe partial image 5010. The annotation input screen 600 is used as adiagnosis information input field for inputting diagnosis information ofthe diagnosis target. In FIG. 15D, the annotation input screen 600 isenlarged for the sake of description.

Although details are to be described later, the annotation input screen600 can be used to input information for pre-set item(s), any comment,and so forth. Further, the annotation input screen 600 can be used toedit information that has already been input. Further, the annotationinput screen 600 can be used to input an image of a designated rangeincluding the position indicated by the tag 5030 a on the partial image5010, and an image acquired from an external device or the like.

In this way, since the position is designated on the full view sphericalimage and the annotation is input to the designated position, thecorrespondence relationship of the annotation and the image can befacilitated, and thereby the efficiency of the working, such as creatingreports, can be improved.

Input Processing of First Embodiment:

Hereinafter, a description is given of the annotation input processingaccording to the first embodiment. FIGS. 16A and 16B (FIG. 16) is anexample of a flowchart illustrating the steps of the annotation inputprocessing according to the first embodiment.

Prior to the performing the processing of the flowchart illustrated inFIG. 16, the information processing apparatus 100 a acquires therespective hemispherical images captured through the imaging lenses 20 aand 20 b and an output value of the acceleration sensor 2008 from theimage capture apparatus 1 a, generates a full view spherical image basedon the respective hemispherical images, and stores the full viewspherical image, for example, in the storage 1004. Hereinafter, in orderto avoid the complexity, a description is given by assuming that onefull view spherical image is generated.

In step S100, in the information processing apparatus 100 a, the imageacquisition unit 100 acquires the full view spherical image stored inthe storage 1004.

In step S101, the UI unit 113 cuts out an image in a given region fromthe full view spherical image acquired by the image acquisition unit 110and displays the cut-out image on the screen 500.

FIG. 17 is an example of a diagram for describing the image cuttingprocess by the UI unit 113 according to the first embodiment. FIG. 17corresponds to FIG. 10A. The UI unit 113 cuts an image of a partialregion 511 in a full view spherical image 510 generated from the twohemisphere images and displays the image of the partial region 511 onthe screen 500 as the partial image 5010.

In step S101, the image of the partial region 511, set as an initialvalue, is cut out. In step S101, for example, the UI unit 113 replacesthe angular coordinates (φ, θ) of the full view spherical image 510 withcoordinates (x, y) suitable for displaying on the screen 500, anddisplays the partial image 5010 on the screen 500.

In step S102, the UI unit 113 moves or changes the partial region 511 inthe full view spherical image 510 in accordance with a user operation.Further, the UI unit 113 can change a size of the partial region 511 inthe full view spherical image 510 in accordance with the user operation.

FIG. 18 is an example of the screen 500 displayed using the displaydevice 1010 under the control of the UI unit 113 according to the firstembodiment. In this example case, the UI unit 113 displays the partialimage 5010 of the partial region 511 illustrated in FIG. 17 on theentire area of the screen 500. Further, buttons 5020, 5021, 5022, and5023, and menu buttons 5050 and 5051 are respectively displayed on thescreen 500.

The button 5020 is a button for reading the full view spherical image510. The button 5021 is a button for storing annotation data, which willbe described later, which is set on the screen 500. In response to anoperation of the button 5021, the screen 500, being currently displayed,can be further saved. The button 5022 is a button for changing a scale,and in accordance with an operation of the button 5022, an instructionscreen for instructing an enlargement and reduction of the partial image5010 in the screen 500 is displayed. The button 523 is a button fordisplaying the annotation input screen 600, which will be describedlater, with respect to the screen 500.

The menu button 5050 is a button for displaying a menu for switching adisplay mode of the screen 500. The menu button 5051 is a button forswitching a viewpoint of the partial image 5010.

Further, a section 5024 is displayed on the screen 500, in which arelative position of each of the image capture positions estimated byusing the function of automatic estimation, described with reference toFIG. 15A, is displayed using pin markers 5040-1 and 5040-2 asillustrated in FIG. 18.

The information processing apparatus 100 a acquires the floor plan image5011 (see FIG. 19) including a diagnosis target in advance, and storesthe floor plan image 5011, for example, in the storage 1004. The UI unit113 can display the floor plan image 5011 on the screen 500. Forexample, the UI unit 113 superimposes the floor plan image 5011 on thesection 5024 in the screen 500 to display the floor plan image 5011.Further, the UI unit 113 can display each pin marker 5040-1 and 5040-2indicating each image capture position for the floor plan image 5011displayed on the screen 500. By performing a given input operation, theuser can adjust the positions of each pin marker 5040-1 and 5040-2 onthe floor plan image 5011 by matching the position of each pin marker5040-1 and 5040-2 at the actual image capture position while maintainingthe relative positional relationship between the pin markers 5040-1 and5040-2.

In step S103, the UI unit 113 waits for an operation of a button 5023,and if the button 5023 is operated, the UI unit 113 displays theannotation input screen 600 on the screen 500. Then, in accordance withan operation on the annotation input screen 600, a position on thepartial image 5010 is designated.

FIG. 20 is an example of the annotation input screen 600 displayed on ascreen 500 according to the first embodiment. In FIG. 20, the samereference numerals are assigned to the same parts as those in FIG. 19,and detailed descriptions thereof will be omitted. In FIG. 20, a marker602 is superimposed on the position designated in step S103. Further, inFIG. 20, the annotation input screen 600 is superimposed and displayedover the partial image 5010. The UI unit 113 associates positioninformation indicating the designated position with identificationinformation identifying the annotation (annotation ID), and stores theposition information and the identification information, for example, inthe RAM 1002. The position information uses, for example, coordinates inthe full view spherical image 510 including the partial image 5010.

The annotation input screen 600 is designed to be displayed on thescreen 500 which is displaying the partial image 5010, but theannotation input screen 600 is not required to be superimposed on thepartial image 5010. For example, the annotation input screen 600 and thepartial image 5010 can be displayed in separate sections in the screen500, or the annotation input screen 600 can be displayed in a windowdifferent from the partial image 5010. Further, a display position ofthe annotation input screen 600 on the screen 500 can be changed inaccordance with the user operation.

In FIG. 20, the annotation input screen 600 includes tabs 601 a, 601 b,and 601 c for switching functions of the annotation input screen 600.The tab 601 a is a tab for editing the annotation. The tab 601 b is atab for inserting a detailed image, in which when the tab 601 b isoperated, a file selection screen provided, for example, by an operatingsystem (OS) of the information processing apparatus 100 a is displayedin accordance with an operation of the tab 601 b, and a desired imagecan be selected from the file selection screen. The tab 601 c is a tabfor selecting a cut region, and in accordance with an operation of thetab 601 c, the display of the screen 500 is switched to a display fordesignating a region on the partial image 5010.

In the annotation input screen 600 illustrated in FIG. 20, a button 6011is a button for adding a tag to the partial image 5010. For example,when the button 6011 is operated, and then a position designationoperation on the partial image 5010 is performed by the cursor 5041, theUI unit 113 displays the marker 602 at the designated position.

In step S104, the UI unit 113 waits for an operation of each of the tab601 a to 601 c of the annotation input screen 600 and the button 5021 inthe screen 500, and performs the corresponding processing in response toan operation of any one of the tabs and buttons (e.g., tabs 601 a to 601c, button 5021).

If the annotation input screen 600 is displayed in response to theoperation of the button 5023 as the initial screen or if the tab 601 ais designated, the UI unit 113 determines to perform the annotationinput processing in step S110, and proceeds the sequence to step S111.

In step S111, as illustrated in FIG. 20, the UI unit 113 displays theannotation input screen 600 including an annotation input field 6010used for inputting the annotation.

In an example case in FIG. 20, the annotation input field 6010 includes,for example, input fields 6010 a to 6010 d. In this example case, theinput field 6010 a serves as an input field for selecting a pre-set itemby a pull-down menu, and it is possible to select various types includedin each item, such as the pre-set item of “facility type.” The inputfield 6010 b serves as an input field for inputting text information. Inthe input field 6010 b, the text information composed of a certainnumber of characters, such as a product type number, is input.

The input field 6010 c serves as an input field having a pre-set itemlist and a check box for inputting a check mark for an appropriate item.The input field 6010 c can designate a plurality of items. Further, theinput field 6010 c displays, for example, one or more itemscorresponding to the contents selected in the input field 6010 a. Inthis example case, the items corresponding to “exit sign” selected inthe “facility type” of the input field 6010 a are displayed in the inputfield 6010 c.

The input field 6010 d serves as, for example, an input field forinputting a note, in which text information can be input with a freeform. The number of character limit may be limitless, or a sentencehaving a certain length can be input. The UI unit 113 stores thecontents input in the annotation input field 6010 in the RAM 1002 inassociation with the position information indicating the positiondesignated in step S103.

Further, even when the annotation input screen 600 is being displayed instep S111, the buttons 5020 to 5023, the tabs 601 a to 601 c, and themenu buttons 5050 and 5051 arranged on the screen 500 can be operatedrespectively. Further, an operation of changing the position of thepartial region 511 within the full view spherical image 510 can be alsoperformed, wherein the partial image 5010 is cut from the full viewspherical image 510.

Further, an operation of designating the next position on the partialimage 5010 can be also performed while the annotation input screen 600is being displayed in step S111.

In step S150, the UI unit 113 determines whether the next position isdesignated on the partial image 5010. If the UI unit 113 determines thatthe next position is designated (step S150: YES), the UI unit 113returns the sequence to step S102.

On the other hand, if the UI unit 113 determines that the next positionis not designated (step S150: NO), the UI unit 113 returns the sequenceto step S104. In this case, the state right before the determination instep S150 is maintained.

Further, if the tab 601 b is designated in step S104, the UI unit 113determines to perform the detailed image insertion and proceeds thesequence to step S120.

In step S121, the UI unit 113 displays an image file selection screen onthe screen 500. The UI unit 113 can use, for example, a file selectionscreen that is provided as a standard function by an Operating System(OS) installed on the information processing apparatus 100 a as an imagefile selection screen. In this case, it is preferable that the filenames are filtered based on, for example, one or more file extensions,and the file names of the corresponding image files alone are displayed.

In step S122, the UI unit 113 reads the image file selected in the imagefile selection screen.

In step S123, the UI unit 113 displays the image file read in step S122in a given area in the annotation input screen 600. The UI unit 113stores the file name including a path to the read image file, forexample, in the RAM 1002 in association with the position informationindicating the position designated in step S103. After the processing instep S123, the sequence proceeds to step S150, which is described above.

Further, if the tab 601 c is designated in step S104, the UI unit 113determines to perform a selection of cut region for the partial image5010 in step S130, and proceeds the sequence to step S131.

In step S131, the UI unit 113 switches the display of the annotationinput screen 600 to a display of the region designation screen used fordesignating the cut region as a diagnosis region including a diagnosistarget image. FIG. 21 is an example of the annotation input screen 600switched to the region designation screen according to the firstembodiment.

In FIG. 21, the annotation input screen 600 switched to the regiondesignation screen includes, for example, a cut image display field6020, a check box 6021, and a selection button 6022. The cut imagedisplay field 6020 displays an image cut from a region designated forthe partial image 5010. In the initial state where no region isdesignated for the partial image 5010, the cut image display field 6020displays, for example, blank, but not limited thereto. For example, animage of a region set in advance for the position designated in stepS103 can be cut from the partial image 5010 and displayed on the cutimage display field 6020.

The check box 6021 is a button for displaying a marker image as an iconfor the cut image display field 6020. The selection button 6022 is abutton for starting the designation of the cut region for the partialimage 5010.

FIG. 22 is an example of the screen 500 being displayed when designatinga cut region in accordance with an operation on the selection button6022 according to the first embodiment. As illustrated in FIG. 22, inresponse to the operation on the selection button 6022, the UI unit 113displays a frame 603 indicating the cut region on the screen 500, anddeletes the display of the annotation input screen 600 from the screen500. For example, the frame 603 is displayed at the region, which ispre-set for the position designated in step S103.

The UI unit 113 can change the size, shape and position of the frame 603in accordance with a user operation. The shape may be, for example,limited to a rectangular shape, but a ratio of lengths of the long sideand the short side can be changed (aspect ratio of rectangular shape canbe changed). When changing the size, shape and position of the frame603, the UI unit 113 can display a warning screen indicating that thechanged frame 603 may not include the position designated in step S103.

Returning to the flowchart of FIG. 16, the UI unit 113 determineswhether the region designation has completed in step S132. For example,if the display device 1010 and the input device 1011 are constituted asthe touch panel 1020 and the region designation by the frame 603 isperformed by a drag operation on the touch panel 1020, such as moving ahand/finger of user in a state being touching the hand/finger of user onthe touch panel 1020, the UI unit 113 can determine that the dragoperation is ended and the region designation by the frame 603 iscompleted when the hand/finger of user is detached from the touch panel1020.

If the UI unit 113 determines that the region designation has not beencompleted (step S132: NO), the UI unit 113 repeats step S132. On theother hand, if the UI unit 113 determines that the region designationhas completed (step S132: YES), the UI unit 113 proceeds the sequence tostep S133.

In step S133, the UI unit 113 acquires an image of the region designatedby the frame 603 from the partial image 5010 as a cut image. In stepS133, the UI unit 113 acquires, for example, coordinates of each vertexof the frame 603 in the full view spherical image 510 including thepartial image 5010, and stores the acquired coordinates in, for example,the RAM 1002 in association with the position information indicating theposition designated in step S103.

In step S134, the UI unit 113 displays the cut image acquired in stepS132 on the cut image display field 6020 in the annotation input screen600. Further, the UI unit 113 stores the cut image acquired in step S133in a file having a given file name to create a cut image file. The UIunit 113 stores the cut image file in, for example, the RAM 1002.

In step S135, the UI unit 113 determines whether an icon is to bedisplayed for the cut image displayed in the cut image display field6020. If the check box 6021 of the annotation input screen 600 ischecked, the UI unit 113 determines that the icon is to be displayed(step S135: YES) and proceeds the sequence to step S136.

In step S136, as illustrated in FIG. 23, the UI unit 113 displays amarker 602 x for the cut image displayed in the cut image display field6020 at a position corresponding to the position designated in stepS103. In this example case, the marker 602 x has the same shape of themarker 602 displayed on the partial image 5010, but the marker 602 x andmarker 602 can be set with different shapes.

After the processing in step S136, the sequence proceeds to step S150,which is described above. Further, if the UI unit 113 determines thatthe icon is not to be displayed in step S135 (step S135: NO), thesequence proceeds to step S150.

Further, if the button 5021 is operated in step S104, the UI unit 113determines to save the contents input on the annotation input screen 600in step S140, and proceeds the sequence to step S141. In step S140, theUI unit 113 instructs the additional information generation unit 112 tostore each data, input in the annotation input screen 600.

In step S141, the additional information generation unit 112 generatesannotation data based on the data stored in the RAM 1002 by the UI unit113 by performing the above described processing (e.g., steps S103,S111, S123, S133, and S134) in accordance with the instruction of the UIunit 113. The annotation data is generated or created for each of thepositions designated in step S103. Then, the additional informationgeneration unit 112 stores the generated annotation data in, forexample, the storage 1004 of the information processing apparatus 100 a,but not limited thereto. For example, the additional informationgeneration unit 112 can transmit the generated annotation data to theserver 6 via the network 5 to store the generated annotation data in theserver 6.

Tables 2 to 7 are examples of configuration of the annotation dataaccording to the first embodiment.

TABLE 2   Annotation data shape data attribute data image region dataannotation data ID

TABLE 3   Shape data primitive type number of vertices list of verticescoordinates

TABLE 4   Attribute data creator name creation date location targetsource image survey date response method attribute information list tag

TABLE 5   Image region data coordinates of upper left vertex coordinatesof lower left vertex coordinates of upper right vertex coordinates oflower right vertex

TABLE 6   Primitive type point line polygon

TABLE 7   Attribute information type name attribute value list

Table 2 is an example of a configuration of annotation data. Theannotation data includes, for example, shape data, attribute data, imageregion data, and annotation data ID. Table 3, Table 4, and Table 5respectively indicate an example of configuration of the shape data,attribute data, and image region data.

Table 3 is an example of the shape data. The shape data indicates ashape formed by the position designated by a user in step S103 in FIG.16, and includes, for example, items of “primitive type,” “number ofvertices” and “list of vertices coordinates.”

Table 6 is an example of values defined in the item of “primitive type.”In this example case, the item of “primitive type” is defined by valuesof “point,” “line” and “polygon.” If the value of the item of “primitivetype” is “point,” the value of “number of vertices” is set to “1.” Ifthe value of the item of “primitive type” is “line,” the value of“number of vertices” is set to “two or more.” Further, if the value ofthe item of “primitive type” is “polygon,” the value of “number ofvertices” is set to “three or more.”

The item of “list of vertices coordinates” defines each vertex indicatedby the item of “vertex number” using (φ, θ) coordinate system. In thisconfiguration, a closed region, enclosed by straight lines connectingthe coordinates listed in the item of “list of vertices coordinates” inthe order starting from the beginning of the listed verticescoordinates, becomes a region of “polygons” defined by the primitivetype.

In the above described example, since one point is designated in stepS103 of FIG. 16, the item of “primitive type” is set to the value of“point” and the number of vertices” is set to “1.” Further, the item of“list of vertices coordinates” describes the coordinates of thedesignated one point alone.

Table 4 is an example of the attribute data. The attribute data includesitems, such as “creator name,” “creation date,” “location,” “target,”“source image,” “survey date,” “response method,” “attribute informationlist,” and “tag.”

The value of the item of “creator name” is acquired from, for example,login information of the information processing apparatus 100 a or thelogin information for an input program for implementing the function ofthe information processing apparatus 100 a according to the firstembodiment. The value of the item of “creation date” is acquired fromsystem time information based on a clock time possessed by theinformation processing apparatus 100 a.

The value of the item of “location” is acquired from an image captureposition on the floor plan image 5011 if the floor plan image 5011 canbe acquired. If the floor plan image 5011 cannot be acquired, the valueof the item of “location” can be acquired from the user input.

The value of the item of “target” is acquired from the user input. Thevalue of the item of “source image” uses an image name (file name) thatthe annotation input screen 600 refers to when the annotation wascreated. For example, the value of the item of “source image” uses thefile name of the cut image file displayed in the cut image display field6020 described above.

The value of the item of “survey date” is acquired from a timestamp ofthe image file described in the item of “source image.” The value of theitem of “response method” is acquired from the user input.

The value of the item of “attribute information list” describes a listof the attribute information illustrated in Table 7. In Table 7, theattribute information includes items, such as “type,” “name,” and“attribute value list.”

The value of the item of “type” is a type of diagnosis target to bediagnosed, such as facilities and findings, and, for example, the valueof the input field 6010 a for inputting the facility type in theannotation input screen 600, illustrated in FIG. 20, is used.

The value of the item of “name” is a specific name of a diagnosis targetto be diagnosed, and, for example, the value of the input field 6010 bused for inputting the product type number in the annotation inputscreen 600, illustrated in FIG. 20, is used.

As to the value of the item of “attribute value list,” for example, thename corresponding to each check box in the input field 6010 c in theannotation input screen 600, illustrated in FIG. 20, is set as anattribute name, and a value of the check box is set as an attributevalue, and then a list of paired attribute names and attribute values isacquired. Each attribute name included in the item of “attribute valuelist” varies depending on the value of the item of “type.”

Returning to the description of Table 4, the value of the item of “tag”is acquired from the user input. For example, as the value of the itemof “tag,” a value (e.g., remarks) input into the input field 6010 d inthe annotation input screen 600 can be used.

Table 5 illustrates an example of the image region data. The imageregion data indicates, for example, coordinates of the upper left, lowerleft, upper right and lower right vertices of the cut image designatedin step S132 of FIG. 16 defined by the angular coordinates (φ, ϕ) of thefull view spherical image 510.

Returning to the description of FIG. 16, in response to the completionof storing the above described annotation data in step S141, theadditional information generation unit 112 proceeds the sequence to stepS142.

In step S141, the additional information generation unit 112 associatesthe information (e.g., file name including a path) indicating the fullview spherical image 510 already acquired at the time of inputting theannotation data, with the annotation data. As indicated in an examplecase in FIG. 15A, the additional information generation unit 112associates the information indicating each of the full view sphericalimages 510, 510, 510 and so on, corresponding to each of the pin markers5040-1, 5040-2, 5040-3 and so on, with the annotation data.

In step S142, the UI unit 113 determines whether an end of theannotation input processing by the input program is instructed. Forexample, after the saving process in step S141, the UI unit 113 displaysan end instruction screen for instructing the end or continuation of theannotation input processing. If the UI unit 113 determines that the endof the annotation input processing by the input program is instructed(step S142: YES), the UI unit 113 ends the series of processes inaccordance with flowchart of FIG. 16. On the other hand, if the UI unit113 determines that the continuation of the annotation input process isinstructed (step S142: NO), the UI unit 113 proceeds the sequence tostep S150.

Output Processing of First Embodiment:

Hereinafter, a description is given of the output processing accordingto the first embodiment. In the information processing apparatus 100 a,the output unit 115 creates a report data summarizing a diagnosis resultof a diagnosis target based on the annotation data created as describedabove.

For example, in the information processing apparatus 100 a, the UI unit113 reads, for example, the respective annotation data and therespective full view spherical images 510 associated with the respectiveannotation data in accordance with the operation of the button 5020 onthe screen 500. The UI unit 113 cuts out the partial region 511 of thefull view spherical image 510, which is one of the read full viewspherical images 510, and displays an image of the partial region 511 asthe partial image 5010 on the screen 500. In this case, the UI unit 113displays, for example, the tag corresponding to each annotation data ata position on the partial image 5010 defined by the value of the item of“list of vertices coordinates” set in the shape data of each annotationdata.

FIG. 24 is an example of a displaying of the screen 500 related to thereport data creation according to the first embodiment. In FIG. 24, forexample, tags 5030 c and 5030 d are displayed on the screen 500 inaccordance with the item of “tag” included in the attribute data of eachannotation data. Further, in correspondence with the tags 5030 c and5030 d, the value (e.g., remarks) input to the item of “tag” aredisplayed as comments 604 a and 604 b. The output unit 115 creates thereport data based on the annotation data corresponding to the designatedtag in accordance with the designation of the tags 5030 c and 5030 d,but not limited thereto.

For example, the output unit 115 can perform a detailed searching basedon a condition designated by a user for each item of attribute data ofeach annotation data and to create the report data based on theannotation data acquired as a result of the searching.

FIG. 25 is an example of the report data according to the firstembodiment. In this example case, the report data includes, for example,records including items of “inspection date” and “inspector name,” anditems of “room name,” “diagnosis target,” “field photograph,”“contents,” “necessity of action” and “remarks” for each annotationdata.

In the recording of the report data, the items of “inspection date” and“inspector name” can be acquired, for example, by referring to Table 4described above including the items of “creation date” and “creatorname” in the attribute data of the annotation data.

The item of “diagnosis target” in the report data can be acquired fromthe item of “name” in the item of “attribute information list” in theattribute data of the annotation data, by referring to Table 4 and Table7.

Further, the items of “room name,” “contents,” and “necessity of action”in the report data can be acquired from the items of “location,” “tag,”and “response method” in the attribute data of the annotation data.

Further, the item of “field photograph” is embedded in the report data,for example, by referring Table 4 and embedding an image acquired usingthe image name described in the item of “source image” in the attributedata of the annotation data.

Further, in the report data, the item of “remarks” can be acquired, forexample, in accordance with the user input at the time of creating thereport data.

Further, the report data of FIG. 25 is just one example, and each itemincluded in the record data is not limited thereto. Further, each itemincluded in the record data can be changed and set in accordance withthe user input.

The output unit 115 outputs the report data created in this manner witha given data format. For example, the output unit 115 can output thereport data using any data format of a commercial document creationapplication program, a table creation application program, anapplication program for creating presentation materials, and the like.Further, the output unit 115 can output the report data using given dataformat, which is specialized in printing and displaying, such asportable document format (PDF).

Further, the output unit 115 is not limited to outputting the reportdata illustrated in FIG. 25. For example, the output unit 115 canassociate link information for calling a panoramic automatic tourfunction (see FIGS. 15B and 15C) starting from the image captureposition corresponding to the concerned image, with the image embeddedin the item of “field photograph.” The UI unit 113 can be configured todisplay the images representing the report data illustrated in FIG. 25on the screen 500. When the image to be displayed as the item of “fieldphotograph” is designated by the user operation, the UI unit 113switches the display of the screen 500 to, for example, the displayillustrated in FIG. 15B in accordance with the link informationassociated with the concerned image, but not limited thereto.

Further, it is also possible to associate an image embedded in the itemof “field photograph” with the link information of a specific web site.

Further, in the report data, it is also possible to include an item of“time-series photograph” for embedding a time-series photographs inplace of the item of “field photograph.” The time-series photographsinclude, for example, a plurality of images, captured at different timesat a specific position in a specific image capture range, and arrangedalong the image capturing time line, and then the plurality of capturedimages is stored, for example, in the information processing apparatus100 a in association with the position information indicating theconcerned position. When displaying the report data on the screen, theoutput unit 115 displays the time-series images containing the pluralityof images for the item of “time-series photograph.” This makes itpossible to easily recognize the progress at a work site, such asconstruction work site. Further, by comparing the images captured atdifferent times, changes over time can be easily confirmed.

In an example of FIG. 24, the annotation data used for creating thereport data is designated by designating the tags 5030 c and 5030 ddisplayed on the screen 500, but not limited thereto. For example, alist including thumbnail images, comments and attribute information canbe created and displayed for each annotation data. The thumbnail imagecan be generated by reducing the size the cut image.

Further, the comments input by a person other than the report creator(inspector) can be associated with the report data. In thisconfiguration, a designer can respond to the report data created by theinspector and the inspector can further responds to the reply ofdesigner, with which the thread function of comments can be implemented.

Variant Example of First Embodiment:

Hereinafter, a description is given of a variant example according tothe first embodiment. In the first embodiment described above, asdescribed with reference to FIGS. 20 to 23, the position is designatedusing the point in step S103 of FIG. 16, but not limited thereto. Forexample, the position can be designated using a line and polygon asdescribed with the item of “primitive type” with reference to Table 3and

Table 6.

Hereinafter, a description is given of designating the position usingthe line in step S103 of FIG. 16. FIG. 26 is an example case when crack610 is observed on a wall surface, which is applied to a variant exampleaccording to the first embodiment. In FIGS. 26 and FIGS. 27 to 29 to bedescribed later, portions corresponding to FIGS. 20 to 23 are denoted bythe same reference numerals, and a detailed description thereof will beomitted.

In FIG. 26, the crack 610 having a linear shape is observed in thepartial image 5010 displayed on the screen 500. In step S103 of FIG. 16,the user designates, for example, an image of the crack 610 on thepartial image 5010 displayed on the touch panel 1020 by following ortracing the image of the crack 610 (e.g., dragging operation). Theposition information indicating the crack 610 is, for example, a set ofposition information of a plurality of points sequentially adjacent eachother, in which each point has a value smaller than a threshold value.

FIG. 27 is an example of the annotation input screen 600 for inputtingan annotation for the designated crack 610 according to a variantexample of the first embodiment, and FIG. 27 corresponds to FIG. 20described above. In FIG. 27, the annotation input screen 600 includes,for example, an annotation input field 6010 x used for inputting theannotation. In this example case, the annotation input field 6010 xincludes, for example, input fields 6010 d and 6010 e.

The input field 6010 e is a field for inputting information on a stateof findings, to such as the crack 610, and includes input items, such as“portion,” “finding type,” “width,” “length,” and “status.” Among theseinput items, the items of “finding type” and “status” are input fieldsfor selecting one of pre-set items using a pull-down menu. Further, theitems of “portion,” “width” and “length” are input with names and valuesby a user. The items of “width” and “length” are not fixed to specificvalues, but the items of “width” and “length” can be corresponded to thecontents selected in the item of “finding type.”

In the annotation input screen 600 illustrated in FIG. 27, a button 6012is a button for adding a finding type. In response to an operation ofthe button 6012, the UI unit 113 adds, for example, a set of the aboveitems of “finding type,” “width,” “length” and “status” to theannotation input screen 600 and displays values of the added items of“finding type,” “width,” “length” and “status.” The “finding type” meansany kind of findings detectable or observable for the structure 4, suchas physical objects observed in the structure 4, potential or imminentabnormalities (e.g., initial defects, aging defects, damages,deformations) that may cause problems, and non-abnormalities portions(e.g., stains) that may not cause problems.

The designation of the cut region to the position designated by the line(pattern) can be performed in the same manner as described in FIG. 21and FIG. 22 in the first embodiment. FIG. 28 is an example of switchingthe annotation input screen 600 to a region designation screen (i.e.,annotation input screen 600 x), which can be applied to the variantexample according to the first embodiment. In this example case, crack610 x corresponding to the crack 610 on the partial image 5010 isdisplayed in the cut image display field 6020. Other potions in theannotation input screen 600 x are the same as the annotation inputscreen 600 described above in FIG. 21.

Further, FIG. 29 is an example of displaying of the screen 500 fordesignating the cut region in accordance with an operation on aselection button 6022 according to the variant example of the firstembodiment. In this case too, the frame 603 is designated to include thecrack 610. If a part or all of the crack 610 is not included (displayed)in the frame 603, the UI unit 113 can display a warning screenindicating that the part or all of the crack 610 is not included(displayed) in the frame 603.

Similarly, the position and the cut region can be designated using thepolygon (including a concave polygon), such as a triangle, a pentagon,and so on.

Second Embodiment

Hereinafter, a description is given of a second embodiment. In the firstembodiment described above, the annotation data is generated based onthe full view spherical image 510 having two-dimensional coordinateinformation. In contrast, in the second embodiment, a three-dimensionalimage having three-dimensional coordinate information can be furtherused for generating the annotation data.

Image Capture Apparatus according to Second Embodiment:

FIG. 30 is an example of a schematic view of an image capture apparatus1 b according to the second embodiment. In FIG. 30, the same referencenumerals are assigned to the same parts as those in FIG. 1, and detaileddescriptions thereof will be omitted. In FIG. 30, the image captureapparatus 1 b includes, for example, a plurality of imaging lenses 20a-1, 20 a-2, 20 a-3, 20 a-4, and 20 a-5 (five imaging lenses) and ashutter button 30 on a first surface of a housing 10 b havingsubstantially rectangular parallelepiped shape. An image capture elementrespectively corresponding to each of the imaging lenses 20 a-1, 20 a-2,. . . , 20 a-5 is provided in the housing 10 b.

Further, a plurality of imaging lenses 20 b-1, 20 b-2, 20 b-3, 20 b-4and 20 b-5 (five imaging lenses) are provided on a second surface of thehousing 10 b, which is the rear side of the first surface. Similar tothe imaging lenses 20 a-1, 20 a-2, . . . , and 20 a-5, an image captureelement respectively corresponding to each of the imaging lenses 20 b-1,20 b-2, . . . , and 20 b-5 is provided in the housing 10 b.

Each of the imaging lenses 20 a-1, 20 a-2, . . . , and 20 a-5 and eachof the imaging lenses 20 b-1, 20 b-2, . . . , and 20 b-5 b, which areset at the respective same height from the bottom surface of the housing10 b constitute a pair of imaging lenses (e.g., imaging lenses 20 a-1and 20 b-1) to configure image capture units 21-1, 21-2, 21-3, 21-4, and21-5 as indicated by dot lines in FIG. 30.

Since the imaging lenses 20 a-1, 20 a-2, . . . , and 20 a-5 and theimaging lenses 20 b-1, 20 b-2, . . . , and 20 b-5 are the same as theimaging lenses 20 a and 20 b described in the first embodiment, adetailed description thereof will be omitted. Further, each of the imagecapture units 21-1, 21-2, 21-3, 21-4, and 21-5 corresponds to the imagecapture unit 21 described above.

In the second embodiment, each of the imaging lenses 20 a-1, 20 a-2, . .. , and 20 a-5 is spaced apart equally from the adjacent imaging lensfor the distance “d.” Further, the imaging lenses 20 a-1 and 20 b-1, theimaging lenses 20 a-2 and 20 b-2, the imaging lenses 20 a-3 and 20 b-3,the imaging lenses 20 a-4 and 20 b-4, and the imaging lenses 20 a-5 and20 b-5 are disposed in the housing 10 b, for example, while matching therespective height from the bottom surface of the housing 10 b.

Further, the imaging lenses 20 a-5 and 20 b-5 of the image capture unit21-5, disposed at the lowest portion of the image capture units 21-1,21-2, 21-3, 21-4, and 21-5, is positioned at the height “h” from thebottom surface of the housing 10 b. Further, for example, each of theimaging lenses 20 a-1 to 20 a-5 is disposed in the housing 10 b bysetting the imaging lens 20 a-5 at the lowest portion of the imaginglenses 20 a-1 to 20 a-5 and disposing the imaging lenses 20 a-4, 20 a-3,20 a-2, and 20 a-1 from the bottom side of the housing 10 b toward theupper face of the housing 10 b while spaced apart equally from theadjacent imaging lens for the distance “d” and aligning the center ofthe each of the imaging lenses 20 a-1 to 20 a-5 along the center line ofthe longitudinal direction of the housing 10 b.

The shutter button 30 is a button for instructing an image captureoperation using each of the imaging lenses 20 a-1, 20 a-2, . . . , and20 a-5 and the imaging lenses 20 b-1, 20 b-2, . . . , and 20 b-5,respectively, in accordance with an operation to the shutter button 30.When the shutter button 30 is operated, the image capture operationsusing the imaging lenses 20 a-1, 20 a-2, . . . , and 20 a-5 and theimaging lenses 20 b-1, 20 b-2, . . . , and 20 b-5 are performed in asynchronized manner.

As illustrated in FIG. 30, the housing 10 b of the image captureapparatus 1 b includes, for example, an image capture portion 2 b, inwhich each of the image capture units 21-1 to 21-5 is disposed, and anoperation portion 3 b, in which the shutter button 30 is disposed. Theoperation portion 3 b is provided with a grip portion 31 for holding theimage capture apparatus 1 b, and a fixing portion 32 for fixing theimage capture apparatus 1 b to a tripod or the like on the bottomsurface of the grip portion 31 b.

Although the image capture apparatus 1 b includes the five image captureunits 21-1 to 21-5, the number of image capture units is not limitedthereto. That is, the image capture apparatus 1 b can be configuredusing a plurality of image capture units 21, such as six or more imagecapture units 21, or two or more image capture units 21.

FIG. 31 illustrates an example of an image capture range that can becaptured by each of the image capture units 21-1, 21-2, 21-3, 21-4, and21-5 of the image capture apparatus 1 b of the second embodiment. Eachof the image capture units 21-1, 21-2, 21-3, 21-4, and 21-5 has asimilar image capture range. In FIG. 31, the image capture range coveredby the image capture units 21-1, 21-2, 21-3, 21-4, and 21-5 isrepresented by the image capture range of the image capture unit 21-1.

In the following description, as illustrated in FIG. 31, for example,Z-axis is defined in the direction that the imaging lenses 20 a-1, 20a-2, . . . , and 20 a-5 are aligned, and X-axis is defined in thedirection of the optical axes of the imaging lenses 20 a-1 and 20 b-1.Further, Y-axis is defined in a plane perpendicular to Z axis andintersecting with X axis with the right angle.

The image capture unit 21-1 sets the center of the image capture unit21-1 as the center of the full view spherical range using a combinationof the imaging lenses 20 a-1 and 20 b-1 to set the image capture range.That is, as described above, the imaging lenses 20 a-1 and 20 b-1 havean angle of 180 degrees or more, preferably greater than 180 degrees,and more preferably 185 degrees or more. Therefore, by combining theimaging lenses 20 a-1 and 20 b-1, the image capture range A on the X-Yplane and an image capture range B on the X-Z plane can be set to 360degrees, and the image capture range of the full view spherical imagecan be implemented by the combinations of the imaging lenses 20 a-1 and20 b-1.

Further, the image capture units 21-1 to 21-5 are arranged in the Z-axisdirection, respectively, spaced apart equally from the adjacent imagecapture units for the distance “d.” Therefore, each set of hemisphericalimages, which are captured using each of the image capture units 21-1,21-2, 21-3, 21-4, and 21-5 by setting the full view spherical range asthe image capture range, becomes images having different viewpointsspaced apart for the distance “d” in the Z-axis direction.

In the second embodiment, the image capture operation using each of theimaging lenses 20 a-1 to 20 a-5 and the imaging lenses 20 b-1 to 20 b-5is performed in a synchronized manner in response to the operation ofthe shutter button 30. Therefore, by using the image capture apparatus 1b according to the second embodiment, it is possible to obtain five setsof paired hemispherical images captured at the same timing with thedifferent viewpoints spaced apart for the distance “d” in the Z-axisdirection for each of the first surface and the second surface of thehousing 10 b.

The five full view spherical images, each generated from each set of thepaired hemispherical images captured at the same timing with thedifferent viewpoints spaced apart for the distance “d” in the Z-axisdirection, become the images aligned along the same epipolar lineextending in the Z-axis direction.

Image Processing of Second Embodiment:

Hereinafter, a description is given of the image processing according tothe second embodiment. FIG. 32 is an example of a functional blockdiagram of the information processing apparatus 100 b used as an inputapparatus for inputting the annotation according to the secondembodiment. Since the hardware configuration of the informationprocessing apparatus 100 a described with reference to FIG. 6 can beapplied as the hardware configuration of the information processingapparatus 100 b, a detailed explanation thereof will be omitted. In FIG.32, the same reference numerals as those in FIG. 7 are denoted by thesame reference numerals, and a detailed description thereof will beomitted.

Similar to the information processing apparatus 100 a described above,as illustrated in FIG. 32, the information processing apparatus 100 b,includes, for example, an image acquisition unit 110, an imageprocessing unit 111, an additional information generation unit 112, a UIunit 113, a communication unit 114, and an output unit 115. Theinformation processing apparatus 100 b further includes, for example, athree-dimensional (3D) information generation unit 120.

The image acquisition unit 110 acquires each of the hemispherical imagescaptured by the respective imaging lenses 20 a-1 to 20 a-5 and therespective imaging lenses 20 b-1 to 20 b-5 of the image captureapparatus 1 b. Based on each hemispherical image acquired by the imageacquisition unit 110, the image processing unit 111 generates the fivefull view spherical images, each having different viewpoints spacedapart for the distance “d” in the Z-axis direction corresponding to eachof the image capture units 21-1 to 21-5, by performing the processingdescribed in FIGS. 8 to 12 and calculating the formulas (1) to (9).

Since the image capture apparatus 1 b according to the second embodimentassumes that the center line connecting the centers of the imaginglenses 20 a-1 to 20 a-5 are set parallel to the vertical direction, theinclination correction processing of the respective hemispherical imagescan be omitted. Further, the inclination correction processing of therespective hemispherical images can be performed by disposing theacceleration sensor 2008 described above in the image capture apparatus1 b, in which the vertical direction is detected based on a detectionresult of the acceleration sensor 2008, and the inclination of the imagecapture apparatus 1 b in the vertical direction is determined, and thenthe inclination correction processing is performed.

The UI unit 113 displays each screen 500 described with reference toFIGS. 18, 20 to 23, and 26 to 29 based on the full view spherical imagegenerated from a set of the hemispherical images, for example, a pair ofimages captured by the image capture unit 21-1 among the five full viewspherical images respectively corresponding to each of the image captureunits 21-1 to 21-5. The generation of annotation data by the additionalinformation generation unit 112 is also performed based on the full viewspherical image generated from the pair of hemispherical images capturedby the image capture unit 21-1.

The 3D information generation unit 120 generates three-dimensionalinformation using the five full view spherical images, each having thedifferent viewpoints spaced apart for the distance “d” in the Z-axisdirection, which is generated by the image processing unit 111.

Three-dimensional Information Generation Process of Second Embodiment:

Hereinafter, a description is given of a three-dimensional informationgeneration process that is performed by the 3D information generationunit 120 of the second embodiment.

FIGS. 33A, 33B, 33C, 33D, and 33E, and 33F (FIG. 33) illustrate examplesof images that are captured from different viewpoints using the imagecapture apparatus 1 b and synthesized by each of the image capture units21-1, 21-2, 21-3, 21-4, and 21-5. FIG. 33A illustrates an example of adiagnosis target 60. FIGS. 33B, 33C, 33D, 33E and 33F illustrate anexample of full view spherical images 300-1, 300-2, 300-3, 300-4, and300-5, which are generated by capturing images of the same diagnosistarget 60 from five different viewpoints and synthesizing the capturedimages. As illustrated in FIGS. 33B to 33F, each of the full viewspherical images 300-1 to 300-5 includes an image of the diagnosistarget 60 by slightly shifting the image of the diagnosis target 60 inaccordance with the distance “d” set between the adjacent image captureunits 21-1 to 21-5.

In FIG. 33, it is assumed that the image capture apparatus 1 b capturesimages of the diagnosis target 60 located at the first face side (frontface side) of the image capture apparatus 1 b, for the sake thedescription, but in actual case, the image capture apparatus 1 b cancapture the images of the diagnosis target 60 surrounding the imagecapture apparatus 1 b. In this case, each of the full view sphericalimages 300-1 to 300-5 is, for example, an image obtained by, forexample, using the equirectangular projection method, in which the leftand right sides represent the same position, and the upper side and thelower side each represent each one point, respectively. That is, thefull view spherical images 300-1 to 300-5 illustrated in FIGS. 33B to33F are images generated by the equirectangular projection method,converted and then cut off partially.

Further, the projection method of the full view spherical images 300-1to 300-5 is not limited to the equirectangular projection method. Forexample, if the full view spherical images 300-1 to 300-5 are notrequired to set the greater angle of view in the Z-axis direction,images using cylindrical projection can be used.

FIG. 34 is an example of a flowchart illustrating the steps of creatinga three-dimensional reconstruction model of the second embodiment. Eachof the steps in this flowchart can be performed by the informationprocessing apparatus 100 b. Further, it is assumed that the imagecapture apparatus 1 b already stores ten hemispherical images capturedby the image capture units 21-1 to 21-5 in a memory disposed in theimage capture apparatus 1 b.

In step S10, the image acquisition unit 110 acquires each hemisphericalimage captured by each of the image capture units 21-1 to 21-5 from theimage capture apparatus 1 b. The image processing unit 111 synthesizeseach of the acquired paired hemispherical images for each of the imagecapture units 21-1 to 21-5 and generates or creates the five full viewspherical images, captured from a plurality of viewpoints, asillustrated in FIG. 33.

In step S11, the 3D information generation unit 120 selects one of thefull view spherical images 300-1 to 300-5 generated in step S10 as areference full view spherical image, in which the full view sphericalimage 300-1 is used as the reference full view spherical image. Then,the 3D information generation unit 120 calculates the disparity of otherfull view spherical images 300-2 to 300-5 with respect to the selectedreference full view spherical image (i.e., full view spherical image300-1) for all pixels of the full view spherical images.

Hereinafter, a description is given of the principle of a disparitycalculation method of the second embodiment. The basic principle ofperforming the disparity calculation using images captured by an imagesensor, such as the image capture element 200 a, uses the method oftriangular surveying. Hereinafter, a description is given of thetriangular surveying with reference to FIG. 35. In FIG. 35, cameras 400a and 400 b include lenses 401 a and 401 b and image capture elements402 a and 402 b, respectively. A distance “D” from a line connecting thecameras 400 a and 400 b to a target 403 is calculated from the imagecapture position information in the images captured by each of the imagecapture elements 402 a and 402 b using the triangular surveying.

In FIG. 35, a value of “f” indicates the focal length of each of thelenses 401 a and 401 b. Further, a length of the line connecting theoptical axis centers of the lenses 401 a and 401 b is defined as abaseline length “B.” The distance “d” between each of the image captureunits 21-1 to 21-5 in an example case of FIG. 30 corresponds to thebaseline length “B.” The difference between the image capture positions“i₁ and i₂” on the image capture elements 402 a and 402 b becomes thedisparity “q” of the target 403. Since the relationship of “D:f=B:q” issatisfied due to a triangle similarity relationship, the distance “D”can be calculated using the following formula (10).

$\begin{matrix}{D = \frac{B \times f}{q}} & (10)\end{matrix}$

Since the focal length “f” and the baseline length “B” are known in theformula (10), a task of processing is calculation of the disparity “q.”Since the disparity “q” corresponds to the difference between the imagecapture position “i₁” and the image capture position “i₂,” the detectionof the correspondence relationship of the image capture position in eachimage captured by the image capture elements 402 a and 402 b becomes afundamental task for the disparity calculation. Typically, a matchingprocess for finding a corresponding position between a plurality ofimages is implemented by searching each disparity on an epipolar linebased on an epipolar constraint.

The searching of disparity can be implemented using variouscomputational methods. For example, a block matching process usingNormalized Cross Correlation (NCC) indicated by the formula (11) can beapplied, but not limited thereto. For example, a high-density disparitycalculation process using Semi Global Matching (SGM) can be alsoapplied. The method used for calculating the disparity can be selectedappropriately depending on the application field of thethree-dimensional reconstruction model that is ultimately generated. Inthe formula (11), the value “p” represents the pixel position, and thevalue “q” represents the disparity.

$\begin{matrix}{{C\left( {p,q} \right)}_{NCC} = \frac{\sum\limits_{j = 0}^{N - 1}\; {\sum\limits_{i = 0}^{M - 1}\; {{I\left( {i,j} \right)}{T\left( {i,j} \right)}}}}{\sqrt{\sum\limits_{j = 0}^{N - 1}\; {\sum\limits_{i = 0}^{M - 1}\; {{I\left( {i,j} \right)}^{2}{\sum\limits_{j = 0}^{N - 1}\; {\sum\limits_{i = 0}^{M - 1}\; {T\left( {i,j} \right)}^{2}}}}}}}} & (11)\end{matrix}$

Based on the cost function, the corresponding relationship of each pixelon the epipolar line is calculated, and the calculation result which isconsidered to be the most similar is selected. In NCC, that is theformula (11), a pixel position where the numerical value C(p,q)_(NCC)has the maximum cost can be regarded as the corresponding point. In SGM,a pixel position having the minimum cost can be regarded as thecorresponding point.

Hereinafter, a description is given of an example of calculating thedisparity using NCC of the formula (11). The block matching methodacquires pixel values of a region to be cut out as a pixel block of Mpixels×N pixels by setting a reference pixel in a reference image as thecenter of the pixel block, and pixel values of a region to be cut out asa pixel block of M pixels×N pixels by setting a target pixel in a targetimage as the center of the pixel block. Based on the acquired pixelvalues, the similarity between the region containing the reference pixeland the region containing the target pixel is calculated. The similarityis compared by shifting the block of M pixels×N pixels in the searchtarget, and the target pixel in the block that is at the position wherethe similarity becomes the highest is regarded as the correspondingpixel corresponding to the reference pixel.

In the formula (11), the value I(i,j) represents pixel values of pixelsin the pixel block in the reference image, and the value T(i,j)represents pixel values of pixels in the pixel block in the targetimage. The calculation of formula (11) is performed while shifting thepixel block in the target image corresponding to the pixel block of Mpixels×N pixels in the reference image with a unit of one pixel tosearch a pixel position where the numerical value C(p,q)_(NCC) becomesthe maximum value.

In a case of using the image capture apparatus 1 b according to thesecond embodiment, the disparity is basically calculated by using theprinciple of triangular surveying described above. The image captureapparatus 1 b includes, for example, five image capture units 21-1 to21-5, with which the five full view spherical images 300-1 to 300-5 canbe captured. That is, the image capture apparatus 1 b can simultaneouslycapture three or more images at the same time. Therefore, in the secondembodiment, the above described principle of triangular surveying isextended and applied.

For example, as indicated in the formula (12), by calculating the sum ofthe disparity “q” of the cost for each camera spaced apart for thebaseline length “B,” the corresponding points in each image captured byeach camera can be detected.

$\begin{matrix}{\sum\limits_{B}{1\; C_{NCC}}} & (12)\end{matrix}$

As one example, it is assumed that a first, a second and a third camerasare arranged on an epipolar line in the order of the first camera, thesecond camera, and the third camera. In this case, the cost calculationis performed using the above-described NCC and SGM for each of a set ofthe first camera and the second camera, a set of the first camera andthe third camera, and a set of the second camera and the third camera,respectively. The distance “D” to the target can be calculated bycalculating the sum of the cost calculated for each of the pairs ofcameras as the total cost and then calculating the minimum value of thetotal cost.

The matching process described above can be also applied to the matchingprocess in the automatic estimation function of the image capturingposition described with reference to FIG. 15A.

Further, as a method of calculating the disparity, a stereo imagemeasurement method using the epipolar plane image (EPI) can be applied.For example, the full view spherical images 300-1 to 300-5 generatedfrom the images captured by each of the image capture units 21-1 to 21-5can be regarded as images captured by a camera that moves at a constantvelocity to create the EPI. By using the EPI, the searching of thecorresponding points between the full view spherical images 300-1 to300-5 can be performed easily, for example, compared to the method usingthe triangular surveying described above.

For example, in the image capture apparatus 1 b, by setting the imagecapture unit 21-1 as the reference image capture unit, the distancesd1-2, d1-3, d1-4 and d1-5 between the image capture unit 21-1 and eachof the image capture units 21-2 to 21-5 are calculated. Based on thecalculation results, a three-dimensional space image having thehorizontal and vertical axes (x, y) and the distance D (=0, d1-2, . . ., d1-5) of the full view spherical images 300-1 to 300-5 is created.Then, a cross-section image on the y-D plane of the three-dimensionalspace image is created as the EPI.

In the EPI created as above described, the points on an object existingin each of the original full view spherical images 300-1 to 300-5 arerepresented as a single straight line. The slope of the straight linechanges in accordance with the distance from the image capture apparatus1 b to the point on the object. Therefore, by detecting the straightline included in the EPI, the corresponding points between the full viewspherical images 300-1 to 300-5 can be determined. Further, the distancefrom the image capture apparatus 1 b to the object corresponding to thestraight line can be calculated based on the inclination of the straightline.

Hereinafter, a description is given of the principle of EPI withreference to FIGS. 36 and 37. FIG. 36A illustrates a set of a pluralityof images 420-1, 420-2, . . . , each of which is a cylindrical image.FIG. 36B schematically illustrates an EPI 422 cut as a plane 421 fromthe set of images 420-1, 420-2, . . . . In an example of FIG. 36A, theimage capture position axes of the images 420-1, 420-2, . . . , are setin the depth direction, and the sets of the images 420-1, 420-2, . . .are superimposed on each other to generate three-dimensional data asillustrated in FIG. 36A. When the sets of the images 420-1, 420-2, . . .generated as the three-dimensional data is cut along the plane 421parallel to the depth direction, the EPI 422 illustrated in FIG. 36B isgenerated.

In other words, the lines having the same X coordinate are extractedfrom each of the images 420-1, 420-2, . . . , and each of the extractedlines are arranged using the respective images 420-1, 420-2, . . . ,respectively containing the each of the extracted lines as the Xcoordinates to generate the EPI 422.

FIG. 37 is a diagram illustrating the principle of EPI of the secondembodiment. FIG. 37A schematically illustrates the EPI 422 of FIG. 36A.In FIG. 37A, the horizontal axis “u” represents the depth direction inwhich each image 420-1, 420-2, . . . are superimposed, and indicates thedisparity while the vertical axis “v” represents the vertical axis ofeach of the images 420-1, 420-2, . . . . The EPI 422 means an imagesuperimposing the captured images in the direction of the baselinelength “B.”

The change in the baseline length “B” is represented by the distance ΔXin FIG. 37B. In FIG. 37B, positions C1 and C2 respectively correspond tothe optical centers of the lenses 401 a and 401 b in FIG. 35. In FIG.37B, positions u1 and u2 are positions respectively defined with respectto the positions C1 and C2 set as the reference position, andrespectively correspond to the image capture positions “i₁” and “i₂” inFIG. 35.

When the each of the images 420-1, 420-2 and so on are arranged alongthe direction of the baseline length “B,” the positions of thecorresponding points in the respective images 420-1, 420-2 and so on canbe represented by a straight line having the inclination “m” or a curvedline on the EPI 422. The inclination “m” becomes the disparity “q” to beused for calculating the distance “D.” The inclination “m” becomessmaller as the distance “D” is closer, and the inclination “m” becomeslarger as the distance “D” is farther. The straight line and the curvedline having the different inclination “m” depending on the distance “D2is referred to as a feature point locus.

The inclination “m” is represented by the following formula (13). In theformula (13), the value Δu is a difference between the position u1 andthe position u2, each of which is the image capture point in FIG. 37B,and the value Δu can be calculated using the formula (14). The distance“D” can be calculated from the slope “m” using the formula (15). In theformulas (13), (14) and (15), the value “v” represents the movingvelocity of the camera, and the value “f” indicates the frame rate ofthe camera. That is, the formulas (13), (14) and (15) are calculationformulas when the omnidirectional image is captured with the frame rate“f” while moving the camera at the constant velocity “v.”

$\begin{matrix}{m = {{- \frac{\Delta \; v}{\Delta \; u}} = {\frac{{- \Delta}\; v}{- \frac{f\; \Delta \; X}{D}} = {\frac{{- \Delta}\; v}{f\; \Delta \; X}D}}}} & (13) \\{{\Delta \; u} = {{{u\; 2} - {u\; 1}} = {{\frac{fx}{D} - \frac{f\left( {{\Delta \; X} + X} \right)}{D}} = \frac{{- f}\; \Delta \; X}{D}}}} & (14) \\{D = {\frac{f\; \Delta \; X}{{- \Delta}\; v}m}} & (15)\end{matrix}$

When the omnidirectional image is used as the image constituting theEPI, the inclination “m” takes a value based on the curve. A descriptionis given with reference to FIGS. 38 and 39. In FIG. 38, spheres 411-1,411-2 and 411-3 indicates a full view spherical image captured by acamera #0, a camera #ref, and a camera #(n−1) having the structure ofthe image capture unit 21 and disposed on a straight line. The interval(baseline length) between the camera #0 and the camera #ref is set todistance “d2,” and the interval (baseline length) between the camera#ref and the camera #(n−1) is set to distance “d1.” Hereinafter, thespheres 411-1, 411-2, and 411-3 are referred to as the full viewspherical images 411-1, 411-2, and 411-3, respectively.

The image capture position on the full view spherical image 411-1 of atarget point P becomes a position having an angle relative to thestraight line where each of the cameras #0, #ref, and #(n−1) aredisposed. Similarly, the image capture positions on the full viewspherical images 411-2 and 411-3 of the target point P become positionsrespectively having an angle φ_(ref) and an angle φ₀ with respect to thestraight line.

FIG. 39 is an example of a profile plotting the angles φ₀, φ_(ref) andφ_(n-1) on the vertical axis, and plotting the positions of each of thecameras #0, #ref, and #(n−1) on the horizontal axis. As indicated inFIG. 39, the image capture position at each of the full view sphericalimages 411-1, 411-2 and 411-3, and the feature point locus indicated asthe positions of each of the cameras #0, #ref, and #(n−1) do not becomethe straight line but become an approximated curve 413 based on thestraight lines 412-1 and 412-2 connecting each of the points.

When calculating the disparity “q” of the entire circumference using thefull view spherical images 300-1 to 300-5, a method of searching thecorresponding points on the approximated curve 413 directly from thefull view spherical images 300-1 to 300-5 can be used as describedabove, or a method of converting the full view spherical images 300-1 to300-5 into images using the pinhole projection system and searching thecorresponding points based on the converted image can be used.

In an example case in FIG. 38, the full view spherical image 411-2 isset as the reference image (ref) among the full view spherical images411-1, 411-2 and 411-3, and the full view spherical image 411-1 is setas the (n−1)th target image, and the full view spherical image 411-3 isset as the zero-th target image. Based on the full view spherical image411-2, which is the reference image, the corresponding points of thefull view spherical images 411-1 and 411-3 are respectively representedby the disparity q_(n-1) and disparity q₀. The disparities q_(n-1) andq₀ can be determined by using various known techniques, such as theabove formula (11).

By using the EPI to create the three-dimensional reconstruction model, alarge amount of the omnidirectional images can be uniformly processed.Further, by using the inclination “m,” the calculation is not limited tothe corresponding points so that the processing becomes robust.

Returning to the flowchart of FIG. 34, the 3D information generationunit 120 proceeds the sequence to step S12 after the disparitycalculation in step S11.

In step S12, the 3D information generation unit 120 performs correctionprocessing on the disparity information indicating the disparitycalculated in step S11. As to the correction processing of the disparityinformation, the correction based on the Manhattan-world hypothesis, theline segment correction, and the like can be applied.

In step S13, the 3D information generation unit 120 converts thedisparity information corrected in step S12 into three-dimensional pointgroup information.

In step S14, the 3D information generation unit 120 performs one or moreprocessing, such as smoothing processing, meshing processing, and thelike on the three-dimensional point group information, converted fromthe disparity information in step S13, as needed. By performing thesequence of steps S10 to S14, the three-dimensional reconstruction modelbased on each of the full view spherical images 300-1 to 300-5 can begenerated.

The sequence of steps S11 to S14, which are described above, can beperformed using structure-from-motion (SFM) software, multi-view stereo(MVS) software, and the like distributed as the open source. Theprograms that are input and operated in the information processingapparatus 100 b include, for example, SFM software, MVS software, andthe like.

As described with reference to FIG. 31, the image capture apparatus 1 bincludes each of the image capture units 21-1 to 21-5 disposed along onthe Z axis. Therefore, the distance from the image capture apparatus 1 bto a target object is preferentially calculated in the radial directionon a plane 40 (see FIG. 40), which is orthogonal to the directionaligning each of the imaging lenses 20 a-1 to 20 a-5. The preferentiallymeans the generation capability of the three-dimensional reconstructionmodel with respect to the angle of view.

That is, as to each direction (radial direction in FIG. 40) on the plane40, the angle of view by each of the image capture units 21-1 to 21-5can include an entire circumference of 360 degrees, and the distance canbe calculated for the entire circumference. On the other hand, in theZ-axis direction, the overlapping portions of the angle of view (imagecapture range) by each of the image capture units 21-1 to 21-5 areincreased. Therefore, in the Z-axis direction, the disparity becomessmaller among the full view spherical images 300-1 to 300-5 generatedfrom the images captured by each of the image capture units 211 to 215near the angle of view of 180 degrees. Therefore, it is difficult tocalculate the distance around the entire circumference of 360 degrees inthe direction of the plane including the Z axis.

As illustrated in FIG. 41, a large space including large buildings 50,50, 50, and so on is considered as a target of creating thethree-dimensional regenerated model. In FIG. 41, the X-axis, Y-axis, andZ-axis correspond to the X-axis, Y-axis, and Z-axis illustrated in FIG.31. In this example case, the modeling for the total angle of view (360degrees) becomes the direction of the plane 40 represented by the X-Yaxes. Therefore, it is preferable to arrange a plurality of imagecapture units 21-1 to 21-5 in the Z-axis direction that is orthogonal tothe plane 40.

Configuration of Signal Processing of Image Capture Apparatus of SecondEmbodiment:

Hereinafter, a description is given of a configuration for signalprocessing of the image capture apparatus 1 b according to the secondembodiment with reference to FIG. 42. FIG. 42 is an example of ahardware block diagram of the image capture apparatus 1 b according tothe second embodiment. In FIG. 42, portions corresponding to FIG. 30 aredenoted by the same reference numerals, and a detailed descriptionthereof will be omitted.

As illustrated in FIG. 42, the image capture apparatus 1 b includes, forexample, image capture elements 200 a-1, 200 a-2, . . . , and 200 a-5,image capture elements 200 b-1, 200 b-2, . . . , and 200 b-5, driveunits 210 a-1, 210 a-2, . . . , and 210 a-5, drive units 210 b-1, 210b-2, . . . , and 210 b-5, buffer memories 211 a-1, 211 a-2, . . . , and211 a-5, and buffer memories 211 b-1, 211 b-2, and 211 b-5.

The image capture elements 200 a-1, 200 a-2, . . . , and 200 a-5, thedrive units 210 a-1, 210 a-2, . . . , and 210 a-5, and the buffermemories 211 a-1, 211 a-2, . . . , and 211 a-5 are respectivelycorresponding to the imaging lenses 20 a-1, 20 a-2, . . . , and 20 a-5,and included in the image capture units 21-1, 21-2 . . . , and 21-5. InFIG. 42, the image capture unit 21-1 of the image capture units 21-1 to21-5 is illustrated in order to avoid complexity.

Similarly, the image capture elements 200 b-1, 200 b-2, . . . , and 200b-5, the drive units 210 b-1, 210 b-2, . . . , and 210 b-5, and thebuffer memories 21 b-1, 211 b-2, . . . , and 211 b-5 are respectivelycorresponding to the imaging lenses 20 b-1, 20 b-2, . . . , and 20 b-5,and included in the image capture units 21-1, 21-2 . . . , and 21-5.

The image capture apparatus 1 b further includes, for example, a controlunit 220, a memory 221, and a switch (SW) 222. The switch 222corresponds to the shutter button 30 illustrated in FIG. 30. Forexample, if the switch 222 is in a closed state, the shutter button 30is in a state that the shutter button 30 is operated.

Hereinafter, the image capture unit 21-1 is described. The image captureunit 21-1 includes, for example, the image capture element 200 a-1, thedrive unit 210 a-1, the buffer memory 211 a-1, the image capture element200 b-1, the drive unit 210 b-1, and the buffer memory 211 b-1.

Since the drive unit 210 a-1 and the image capture element 200 a-1, andthe drive unit 210 b-1 and the image capture element 200 b-1 areequivalent to the drive unit 210 a and the image capture element 200 a,and the drive unit 210 b and the image capture element 200 b, describedwith reference to FIG. 4, a detailed explanation thereof will be omittedhere.

The buffer memory 211 a-1 is a memory capable of storing the capturedimage for at least one frame. The captured image output from the driveunit 210 a-1 is temporarily stored in the buffer memory 211 a-1.

Since the functions of the image capture units 21-2 to 21-5 areequivalent to those of the image capture unit 21-1, the descriptionthereof will be omitted here.

The control unit 220 controls the overall operation of the image captureapparatus 1 b. When the control unit 220 detects a transition of theswitch 222 from the open state to the closed state, the control unit 220outputs trigger signals. The trigger signals are simultaneously suppliedto each of the drive units 210 a-1, 210 a-2, . . . , 210 a-5, and eachof the drive units 210 b-1, 210 b-2, . . . , 210 b-5.

Under the control of the control unit 220 in accordance with the outputof the trigger signals, the memory 221 reads each captured image fromeach of the buffer memories 211 a-1, 211 a-2, . . . , 211 a-5, and eachof the buffer memories 211 b-1, 211 b-2, . . . , 211 b-5, and storeseach of the captured images. Each of the captured image stored in thememory 221 can be read by the information processing apparatus 100 bconnected to the image capture apparatus 1 b.

The battery 2020 is, for example, a secondary battery, such as a lithiumion secondary battery, and is used as a power supply unit for supplyingelectric power to each unit of the image capture apparatus 1 b thatneeds to be supplied with power. The battery 2020 includes, for example,a charge/discharge control circuit for controlling charge and dischargeto and from the second battery.

FIG. 43 is an example of a hardware block diagram of the control unit220 and the memory 221 of the image capture apparatus 1 b according tothe second embodiment. In FIG. 43, the same reference numerals areassigned to the same parts as those in FIG. 4, and detailed descriptionsthereof will be omitted.

As illustrated in FIG. 43, the control unit 220 includes, for example, aCPU 2000, a ROM 2001, a trigger I/F 2004, a switch (SW) circuit 2005, adata I/F 2006, and a communication I/F 2007, which are communicativelyconnected to a bus 2010. The memory 221 includes, for example, a RAM2003 and a memory controller 2002, and the memory controller 2002 isconnected to the bus 2010. The battery 2020 supplies power to the CPU2000, the ROM 2001, the memory controller 2002, the RAM 2003, thetrigger I/F 2004, the switch circuit 2005, the data I/F 2006, thecommunication I/F 2007, and the bus 2010, respectively.

The memory controller 2002 controls data storage and reading to and fromthe RAM 2003 in accordance with instruction of the CPU 2000. Inaccordance with the instruction of the CPU 2000, the memory controller2002 also controls reading of the captured image from each of the buffermemories 211 a-1, 211 a-2, . . . , 211 a-5, and each of the buffermemories 211 b-1, 211 b-2, 211 b-5.

The switch circuit 2005 detects a transition of the switch 222 betweenthe closed state and the open state and transfers a detection result tothe CPU 2000. When the CPU 2000 receives the detection result indicatingthat the switch 222 has transitioned from the open state to the closedstate from the switch circuit 2005, the CPU 2000 outputs triggersignals. The trigger signals are output via the trigger I/F 2004, andsupplied to each of the drive units 210 a-1, 210 a-2, . . . , and 210a-5, and each of the drive units 210 b-1, 210 b-2, . . . , and 210 b-5.

The CPU 2000 outputs the trigger signals in accordance with thedetection result of the switch circuit 2005, but not limited thereto.For example, the CPU 2000 can be configured to output the triggersignals in accordance with a signal supplied via the data I/F 2006 andthe communication I/F 2007. Further, the trigger I/F 2004 can beconfigured to generate the trigger signals in accordance with thedetection result of the switch circuit 2005, and supplies the triggersignals to each of the drive units 210 a-1, 210 a-2, . . . , and 210a-5, and each of the drive units 210 b-1, 210 b-2, 210 b-5.

In this configuration, when the control unit 220 detects a transition ofthe switch 222 from the open state to the closed state, the control unit220 generates and outputs the trigger signals. Then, the trigger signalsare supplied at the same timing to each of the drive units 210 a-1, 210a-2, . . . , 210 a-5, and each of the drive units 210 b-1, 210 b-2, 210b-5. By synchronizing with the supplied trigger signals, each of thedrive units 210 a-1, 210 a-2, 210 a-5, and each of the drive units 210b-1, 210 b-2, 210 b-5 receives electric charges from each of the imagecapture elements 200 a-1, 200 a-2, . . . , and 200 a-5 and each of theimage capture elements 200 b-1, 200 b-2, . . . , and 200 b-5,respectively.

Then, each of the drive units 210 a-1, 210 a-2, . . . , 210 a-5, andeach of the drive units 210 b-1, 210 b-2, 210 b-5, respectively,converts the electric charges received from each of the image captureelements 200 a-1, 210 a-2, . . . , 210 a-5, and each of the imagecapture elements 200 b-1, 210 b-2, 210 b-5 into the captured image data,and stores each of the captured image data in each of the buffermemories 211 a-1, 211 a-2, . . . , 211 a-5, and each of the buffermemories 211 b-1, 211 b-2, 211 b-5, respectively.

At a given timing after outputting the trigger signals, the control unit220 instructs the memory 221 to read the captured image data from eachof the buffer memories 211 a-1, 211 a-2, . . . , 211 a-5, and each ofthe buffer memories 211 b-1, 211 b-2, . . . , 211 b-5. In accordancewith this instruction, in the memory 221, the memory controller 2002reads each of the captured image data from each of the buffer memories211 a-1, 211 a-2, . . . , 211 a-5, and each of the buffer memories 211b-1, 211 b-2, 211 b-5, and stores each of the captured image data in agiven area of the RAM 2003.

If the information processing apparatus 100 b is connected to the imagecapture apparatus 1 b via, for example, the data I/F 2006, theinformation processing apparatus 100 b requests the reading of each ofthe captured image data (e.g., hemispherical image) stored in the RAM2003 to the image capture apparatus 1 b via the data I/F 2006. Inresponse to this request, the CPU 2000 of the image capture apparatus 1b instructs the memory controller 2002 to read each of the capturedimage data from the RAM 2003. In response to this instruction, thememory controller 2002 reads each of the captured image data from theRAM 2003 and transmits each of the captured image data to theinformation processing apparatus 100 b via the data I/F 2006. Then, theinformation processing apparatus 100 b performs the sequence inaccordance with flowchart of FIG. 34 based on each of the captured imagedata transmitted from the image capture apparatus 1 b.

Similar to the image capture apparatus 1 a according to the firstembodiment described with reference to FIG. 5, in the image captureapparatus 1 b according to the second embodiment, the battery 2020 andthe circuit unit 2030 are provided inside the housing 10 b. As to thebattery 2020 and the circuit unit 2030, at least the battery 2020 isfixed inside the housing 10 b by fixing means, such as adhesive and ascrew. The circuit unit 2030 includes, for example, at least abovedescribed each unit, such as the control unit 220 and the memory 221.The control unit 220 and the memory 221 are configured on, for example,one or more circuit boards. In the image capture apparatus 1 b, thebattery 2020 and the circuit unit 2030 are arranged on a portionextended from the imaging lenses 20 a-1, 20 a-2, . . . , 20 a-5 arrangedin an aligned manner.

The battery 2020 and the circuit unit 2030 can be arranged at givenpositions as above described, but not limited thereto. For example, ifthe circuit unit 2030 is sufficiently small, at least the battery 2020alone may be disposed at a given position.

By arranging the battery 2020 and the circuit unit 2030 in this manner,the width of the face (front and rear faces) where the imaging lenses 20a-1, 20 a-2, . . . , 20 a-5 (and imaging lenses 20 b-1, 20 b-2, . . . ,20 b-5) of the image capture apparatus 1 b are disposed can be reduced.As a result, it is possible to reduce an inclusion of an image of aportion of the housing 10 b of the image capture apparatus 1 b in eachof the image data captured by each of the image capture units 21-1 to21-5, with which the disparity can be calculated with higher accuracy.For the same reason, it is preferable to set the width of the housing 10b of the image capture apparatus 1 b smaller as much as possible. Thisis the same for the image capture apparatus 1 a according to the firstembodiment:

Annotation Input Method of Second Embodiment:

Hereinafter, a description is given of an annotation input methodaccording to the second embodiment. As described above, in theinformation processing apparatus 100 b, the UI unit 113 displays ascreen used for inputting the annotation using the full view sphericalimage generated from the hemispherical images captured, for example, byone image capture unit (e.g., image capture unit 21-1) among the fiveimage capture units 21-1 to 21-5 of the image capture apparatus 1 b.That is, the UI unit 113 cuts an image of a part of the full viewspherical image and displays the cut-out image on the screen 500, forexample, as the partial image 5010 as illustrated in FIG. 20.

In this case, the UI unit 113 acquires the three-dimensional point groupinformation generated in accordance with flowchart of FIG. 34 from the3D information generation unit 120. The UI unit 113 can switch thedisplay of the screen 500 between the partial image 5010 and thethree-dimensional point group information, for example, in accordancewith an operation on the menu button 5050. The UI unit 113 displays eachpoint included in the three-dimensional point group information usingdifferent colors, for example, in accordance with the distance of eachpoint.

Further, in the second embodiment, the annotation input process isperformed in accordance with flowchart of FIG. 16 described above. Theposition designation in step S103 in the flowchart of FIG. 16 isperformed using three-dimensional coordinates based on thethree-dimensional point group information.

FIGS. 44A and 44B (FIG. 44) is an example of a diagram for describing aposition designation according to the second embodiment. As illustratedin FIG. 44A, it is assumed that objects 700-1, 700-2, and 700-3 havingthree-dimensional structure are arranged in a three-dimensional spacerepresented by the X-axis, the Y-axis, and the Z-axis, which areorthogonal to each other, and then the image capture apparatus 1 bperforms the image capture operation of the objects 700-1, 700-2, and700-3. The three-dimensional point group information generated from thefull view spherical image acquired by the image capture operation of theimage capture apparatus 1 b includes, for example, three-dimensionalinformation at each point on a face of each of the objects 700-1, 700-2,and 700-3 facing the image capture position.

FIG. 44B is an example of the partial image 5010 cut from the full viewspherical image captured by the image capture unit 21-1 of the imagecapture apparatus 1 b in a condition of FIG. 44A. The UI 13 displays thepartial image 5010 illustrated in FIG. 44B on the screen 500. Then, itis assumed that a user designates positions of points on the objects700-1 and 700-2 using, for example, markers 602 ax and 602 bx on thepartial image 5010 illustrated in FIG. 44B.

In the second embodiment, the three-dimensional point group informationof the space including the objects 700-1 to 700-3, captured by the imagecapture apparatus 1 b, can be acquired. Therefore, the positiondesignation is performed at the positions indicated by thethree-dimensional coordinates of each of the objects 700-1 and 700-2, asindicated by the markers 602 ax and 602 bx in FIG. 44A.

Further, when the position designation is performed by using thethree-dimensional coordinates based on the three-dimensional point groupinformation, it is possible to calculate a length of line or an area ofpolygon, for example, when the position is designated by the line or thepolygon, as above described in the variant example according to thefirst embodiment.

FIG. 45 is an example of a diagram for describing a position designationusing a line according to the second embodiment. Similar to FIG. 44A, inFIG. 45, objects 700-4 and 700-5 having three dimensional structure arearranged in a three-dimensional space represented by the X-axis, theY-axis, and the Z-axis, which are orthogonal to each other. Further, acrack 610 a is observed on a column face of the object 700-4 having acylindrical shape, in which the crack 610 a extends along acircumferential direction of the column face. Further, a crack 610 b isobserved across two side faces of the object 700-5 having a rectangularparallelepiped shape (square column).

It is difficult to calculate the length of crack 610 a and the length ofcrack 610 b having the depth from the two-dimensional captured image.For example, as to the crack 610 included in the partial image 5010illustrated in FIG. 26 in the first embodiment, the length of crack 610can be measured if the wall surface where the crack 610 exists isparallel to the face of the partial image 5010. By contrast, in thesecond embodiment, since the three-dimensional point group informationof the objects 700-4 and 700-5 can be obtained, the length of the cracks610 a and 610 b having the depth information can be easily calculated.This is the same for the calculation of the area of the polygon, theperimeter of the polygon, and the like.

As to the above described embodiments, a task of associating informationof a target and position information of the target on an image can beeasily performed.

Numerous additional modifications and variations are possible in lightof the above teachings. It is therefore to be understood that, withinthe scope of the appended claims, the disclosure of this patentspecification can be practiced otherwise than as specifically describedherein. Any one of the above-described operations may be performed invarious other ways, for example, in an order different from the onedescribed above.

Each of the functions of the above-described embodiments can beimplemented by one or more processing circuits or circuitry. Processingcircuitry includes a programmed processor, as a processor includescircuitry. A processing circuit also includes devices such as anapplication specific integrated circuit (ASIC), digital signal processor(DSP), field programmable gate array (FPGA), system on a chip (SOC),graphics processing unit (GPU), and conventional circuit componentsarranged to perform the recited functions.

What is claimed is:
 1. An input apparatus for inputting a diagnosis result of a diagnosis target detectable for a structure, comprising: circuitry configured to display a spherical image captured for the structure on a screen; receive an input of a position of the diagnosis target in the spherical image; store position information indicating the received position of the diagnosis target in the spherical image in a memory; display, on the screen, the spherical image and a diagnosis information input field used for inputting diagnosis information of the diagnosis target; receive an input of the diagnosis information of the diagnosis target, input via the diagnosis information input field; and store the diagnosis information and the position information indicating the received position of the diagnosis target in the spherical image, in the memory in association with each other.
 2. The input apparatus according to claim 1, wherein the circuitry superimposes, on the spherical image, an image indicating the diagnosis target in accordance with the position information.
 3. The input apparatus according to claim 1, wherein the circuitry receives an input of a diagnosis region including the diagnosis target on the spherical image, and stores the position information corresponding to the received diagnosis region and the diagnosis information of the diagnosis target in the memory in association with each other.
 4. The input apparatus according to claim 3, wherein the circuitry displays an image in the diagnosis region on the spherical image, in the diagnosis information input field.
 5. The input apparatus according to claim 1, wherein the circuitry displays an image different from the spherical image in the diagnosis information input field.
 6. The input apparatus according to claim 1, wherein the circuitry displays the diagnosis information input field by superimposing the diagnosis information input field on a part of the spherical image.
 7. The input apparatus according to claim 1, wherein when the circuitry receives an input of a group of positions including a plurality of positions defining the diagnosis target in the spherical image, the circuitry stores position information indicating the received group of positions in the memory.
 8. The input apparatus according to claim 1, wherein the circuitry displays a drawing indicating a configuration of the structure and the spherical image on one screen.
 9. The input apparatus according to claim 1, wherein the spherical image is a three-dimensional image having three-dimensional information, and the circuitry receives the input of the position having three-dimensional information indicating the diagnosis target.
 10. A method of inputting a diagnosis result of a diagnosis target detectable for a structure, the method comprising: displaying a spherical image captured for the structure on a screen; receiving an input of a position of the diagnosis target in the spherical image; storing position information indicating the received position in a memory; displaying, on the screen, the spherical image and a diagnosis information input field used for inputting diagnosis information of the diagnosis target; receiving an input of the diagnosis information of the diagnosis target, input via the diagnosis information input field; and storing the diagnosis information and the position information in the memory in association with each other.
 11. An output apparatus for outputting a diagnosis result of a diagnosis target detectable for a structure, comprising: circuitry configured to acquire position information indicating a position of the diagnosis target in a spherical image, captured for the structure, and diagnosis information including a diagnosis result of the diagnosis target stored in a memory in association with each other, from the memory; and output the acquired diagnosis information of the diagnosis target based on the position information associated with the diagnosis information.
 12. The output apparatus according to claim 11, wherein the memory stores an image based on the spherical image in association with the position information, wherein the circuitry acquires the image based on the spherical image from the memory, and outputs the acquired image together with the diagnosis information based on the position information.
 13. The output apparatus according to claim 11, wherein the circuitry acquires the spherical image, and displays, on a screen, the spherical image together with information indicating the diagnosis information associated with the position information on the spherical image, by superimposing the information indicating the diagnosis information at a position indicated by the position information on the spherical image.
 14. The output apparatus according to claim 11, wherein the circuitry receives an input of designating information indicating the diagnosis information to be superimposed and displayed on the spherical image, and outputs the diagnosis information designated by the input received by the circuitry. 